Overview

Dataset statistics

Number of variables6
Number of observations1984
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory102.8 KiB
Average record size in memory53.1 B

Variable types

Numeric5
Text1

Dataset

Description샘플 데이터
Author신한은행
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=320

Alerts

OBJECTID_1(OBJECTID_1) is highly overall correlated with Y중심좌표(Y_Center)High correlation
X중심좌표(X_Center) is highly overall correlated with 공간길이(SHAPE_LENG) and 1 other fieldsHigh correlation
Y중심좌표(Y_Center) is highly overall correlated with OBJECTID_1(OBJECTID_1)High correlation
공간길이(SHAPE_LENG) is highly overall correlated with X중심좌표(X_Center) and 1 other fieldsHigh correlation
공간면적(SHAPE_AREA) is highly overall correlated with X중심좌표(X_Center) and 1 other fieldsHigh correlation
OBJECTID_1(OBJECTID_1) has unique valuesUnique
그리드코드(GRID50_CD) has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:59:30.842842
Analysis finished2023-12-10 14:59:37.629460
Duration6.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

OBJECTID_1(OBJECTID_1)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1984
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74272.196
Minimum5418
Maximum241647
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-10T23:59:37.847918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5418
5-th percentile11724.3
Q124753.5
median46361
Q3119475
95-th percentile209714.3
Maximum241647
Range236229
Interquartile range (IQR)94721.5

Descriptive statistics

Standard deviation64439.208
Coefficient of variation (CV)0.86760875
Kurtosis-0.11336939
Mean74272.196
Median Absolute Deviation (MAD)26105.5
Skewness1.0802499
Sum1.4735604 × 108
Variance4.1524115 × 109
MonotonicityNot monotonic
2023-12-10T23:59:38.194287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5418 1
 
0.1%
13887 1
 
0.1%
34243 1
 
0.1%
31705 1
 
0.1%
28131 1
 
0.1%
26061 1
 
0.1%
24671 1
 
0.1%
23342 1
 
0.1%
22150 1
 
0.1%
21121 1
 
0.1%
Other values (1974) 1974
99.5%
ValueCountFrequency (%)
5418 1
0.1%
5424 1
0.1%
5427 1
0.1%
5545 1
0.1%
5551 1
0.1%
5553 1
0.1%
5555 1
0.1%
5560 1
0.1%
5563 1
0.1%
5566 1
0.1%
ValueCountFrequency (%)
241647 1
0.1%
241464 1
0.1%
241269 1
0.1%
240880 1
0.1%
240588 1
0.1%
239826 1
0.1%
239518 1
0.1%
239381 1
0.1%
239259 1
0.1%
237691 1
0.1%
Distinct1984
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2023-12-10T23:59:38.922588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters19840
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1984 ?
Unique (%)100.0%

Sample

1st rowGS00005418
2nd rowGS00031992
3rd rowGS00052227
4th rowGS00067035
5th rowGS00097864
ValueCountFrequency (%)
gs00005418 1
 
0.1%
gs00007513 1
 
0.1%
gs00031705 1
 
0.1%
gs00028131 1
 
0.1%
gs00026061 1
 
0.1%
gs00024671 1
 
0.1%
gs00023342 1
 
0.1%
gs00022150 1
 
0.1%
gs00021121 1
 
0.1%
gs00020161 1
 
0.1%
Other values (1974) 1974
99.5%
2023-12-10T23:59:39.904187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6274
31.6%
G 1984
 
10.0%
S 1984
 
10.0%
1 1622
 
8.2%
2 1254
 
6.3%
3 1191
 
6.0%
4 1148
 
5.8%
5 978
 
4.9%
7 867
 
4.4%
6 851
 
4.3%
Other values (2) 1687
 
8.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15872
80.0%
Uppercase Letter 3968
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6274
39.5%
1 1622
 
10.2%
2 1254
 
7.9%
3 1191
 
7.5%
4 1148
 
7.2%
5 978
 
6.2%
7 867
 
5.5%
6 851
 
5.4%
8 847
 
5.3%
9 840
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
G 1984
50.0%
S 1984
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15872
80.0%
Latin 3968
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6274
39.5%
1 1622
 
10.2%
2 1254
 
7.9%
3 1191
 
7.5%
4 1148
 
7.2%
5 978
 
6.2%
7 867
 
5.5%
6 851
 
5.4%
8 847
 
5.3%
9 840
 
5.3%
Latin
ValueCountFrequency (%)
G 1984
50.0%
S 1984
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19840
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6274
31.6%
G 1984
 
10.0%
S 1984
 
10.0%
1 1622
 
8.2%
2 1254
 
6.3%
3 1191
 
6.0%
4 1148
 
5.8%
5 978
 
4.9%
7 867
 
4.4%
6 851
 
4.3%
Other values (2) 1687
 
8.5%

X중심좌표(X_Center)
Real number (ℝ)

HIGH CORRELATION 

Distinct1930
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200383.96
Minimum179752
Maximum215862
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-10T23:59:40.230233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum179752
5-th percentile187167.5
Q1195216.25
median201345
Q3205887.25
95-th percentile209570.45
Maximum215862
Range36110
Interquartile range (IQR)10671

Descriptive statistics

Standard deviation6919.2007
Coefficient of variation (CV)0.034529714
Kurtosis-0.27655431
Mean200383.96
Median Absolute Deviation (MAD)5075.5
Skewness-0.50136917
Sum3.9756177 × 108
Variance47875338
MonotonicityNot monotonic
2023-12-10T23:59:40.530225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
198172 2
 
0.1%
193162 2
 
0.1%
197472 2
 
0.1%
204576 2
 
0.1%
198805 2
 
0.1%
199702 2
 
0.1%
192257 2
 
0.1%
208575 2
 
0.1%
204776 2
 
0.1%
202416 2
 
0.1%
Other values (1920) 1964
99.0%
ValueCountFrequency (%)
179752 1
0.1%
180795 1
0.1%
180799 1
0.1%
181002 1
0.1%
181104 1
0.1%
181608 1
0.1%
181637 1
0.1%
181809 1
0.1%
181810 1
0.1%
181837 1
0.1%
ValueCountFrequency (%)
215862 1
0.1%
215819 1
0.1%
215757 1
0.1%
215220 1
0.1%
215210 1
0.1%
215055 1
0.1%
215004 1
0.1%
214857 1
0.1%
214760 1
0.1%
214617 1
0.1%

Y중심좌표(Y_Center)
Real number (ℝ)

HIGH CORRELATION 

Distinct1631
Distinct (%)82.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean455025.76
Minimum438706
Maximum464461
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-10T23:59:40.929149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum438706
5-th percentile442667.2
Q1450139.25
median456951.5
Q3460006.25
95-th percentile462731
Maximum464461
Range25755
Interquartile range (IQR)9867

Descriptive statistics

Standard deviation6332.5253
Coefficient of variation (CV)0.01391685
Kurtosis-0.4256305
Mean455025.76
Median Absolute Deviation (MAD)3701
Skewness-0.77321136
Sum9.0277111 × 108
Variance40100877
MonotonicityNot monotonic
2023-12-10T23:59:41.294933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
464432 3
 
0.2%
459843 3
 
0.2%
456951 3
 
0.2%
460625 3
 
0.2%
457199 3
 
0.2%
457386 3
 
0.2%
457713 3
 
0.2%
459823 3
 
0.2%
458123 3
 
0.2%
457331 3
 
0.2%
Other values (1621) 1954
98.5%
ValueCountFrequency (%)
438706 1
0.1%
438765 1
0.1%
438797 1
0.1%
438822 1
0.1%
438972 1
0.1%
439106 1
0.1%
439108 1
0.1%
439153 1
0.1%
439215 1
0.1%
439446 1
0.1%
ValueCountFrequency (%)
464461 1
 
0.1%
464460 1
 
0.1%
464459 1
 
0.1%
464437 1
 
0.1%
464436 1
 
0.1%
464435 1
 
0.1%
464433 1
 
0.1%
464432 3
0.2%
464431 2
0.1%
464430 1
 
0.1%

공간길이(SHAPE_LENG)
Real number (ℝ)

HIGH CORRELATION 

Distinct1313
Distinct (%)66.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.07531
Minimum200.07083
Maximum200.07863
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-10T23:59:41.732180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum200.07083
5-th percentile200.07243
Q1200.07424
median200.07554
Q3200.07644
95-th percentile200.07724
Maximum200.07863
Range0.007800952
Interquartile range (IQR)0.0022066298

Descriptive statistics

Standard deviation0.0015083557
Coefficient of variation (CV)7.5389396 × 10-6
Kurtosis-0.26970886
Mean200.07531
Median Absolute Deviation (MAD)0.001101051
Skewness-0.50182829
Sum396949.42
Variance2.2751369 × 10-6
MonotonicityNot monotonic
2023-12-10T23:59:42.076472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200.077237683 17
 
0.9%
200.073639863 10
 
0.5%
200.076841952 9
 
0.5%
200.075246239 9
 
0.5%
200.076441957 9
 
0.5%
200.0760377 8
 
0.4%
200.076841951 8
 
0.4%
200.075637703 8
 
0.4%
200.076837689 8
 
0.4%
200.075637706 8
 
0.4%
Other values (1303) 1890
95.3%
ValueCountFrequency (%)
200.070831388 1
0.1%
200.071031384 1
0.1%
200.071032447 1
0.1%
200.071129785 1
0.1%
200.071229252 1
0.1%
200.071229253 1
0.1%
200.071229254 1
0.1%
200.071230315 1
0.1%
200.071233508 1
0.1%
200.071233509 1
0.1%
ValueCountFrequency (%)
200.07863234 1
0.1%
200.078631276 2
0.1%
200.078629148 1
0.1%
200.078532873 1
0.1%
200.078531808 1
0.1%
200.078431279 2
0.1%
200.078429153 1
0.1%
200.07842915 1
0.1%
200.078427026 1
0.1%
200.078332875 1
0.1%

공간면적(SHAPE_AREA)
Real number (ℝ)

HIGH CORRELATION 

Distinct1333
Distinct (%)67.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2501.8832
Minimum2501.7711
Maximum2501.9662
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-10T23:59:42.984604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2501.7711
5-th percentile2501.811
Q12501.8562
median2501.8889
Q32501.9114
95-th percentile2501.9314
Maximum2501.9662
Range0.19509668
Interquartile range (IQR)0.055186525

Descriptive statistics

Standard deviation0.037723021
Coefficient of variation (CV)1.5077851 × 10-5
Kurtosis-0.26973135
Mean2501.8832
Median Absolute Deviation (MAD)0.027536595
Skewness-0.50181161
Sum4963736.2
Variance0.0014230263
MonotonicityNot monotonic
2023-12-10T23:59:43.476975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2501.93131493 17
 
0.9%
2501.84133551 10
 
0.5%
2501.92141783 9
 
0.5%
2501.91141414 9
 
0.5%
2501.88150985 9
 
0.5%
2501.91130756 8
 
0.4%
2501.92131123 8
 
0.4%
2501.89130014 8
 
0.4%
2501.89130023 8
 
0.4%
2501.92141781 8
 
0.4%
Other values (1323) 1890
95.3%
ValueCountFrequency (%)
2501.77109826 1
0.1%
2501.77609995 1
0.1%
2501.77612652 1
0.1%
2501.77856084 1
0.1%
2501.78104839 1
0.1%
2501.78104842 1
0.1%
2501.78104844 1
0.1%
2501.78107499 1
0.1%
2501.78115483 1
0.1%
2501.78115486 1
0.1%
ValueCountFrequency (%)
2501.96619494 1
0.1%
2501.96616833 2
0.1%
2501.96611509 1
0.1%
2501.96370728 1
0.1%
2501.96368065 1
0.1%
2501.96116645 2
0.1%
2501.96111326 1
0.1%
2501.96111319 1
0.1%
2501.96106007 1
0.1%
2501.95870538 1
0.1%

Interactions

2023-12-10T23:59:36.037635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:31.465289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:32.505260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:33.770856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:34.895221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:36.241490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:31.657306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:32.747755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:34.016176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:35.099795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:36.469966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:31.865142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:33.002426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:34.241706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:35.330843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:36.668638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:32.048706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:33.243629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:34.426416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:35.558279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:36.919359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:32.265668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:33.476105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:34.657185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:59:35.799203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:59:43.747256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
OBJECTID_1(OBJECTID_1)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
OBJECTID_1(OBJECTID_1)1.0000.5520.9910.5470.547
X중심좌표(X_Center)0.5521.0000.5690.9980.998
Y중심좌표(Y_Center)0.9910.5691.0000.5680.568
공간길이(SHAPE_LENG)0.5470.9980.5681.0001.000
공간면적(SHAPE_AREA)0.5470.9980.5681.0001.000
2023-12-10T23:59:43.962584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
OBJECTID_1(OBJECTID_1)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
OBJECTID_1(OBJECTID_1)1.000-0.171-1.000-0.192-0.192
X중심좌표(X_Center)-0.1711.0000.1800.9980.998
Y중심좌표(Y_Center)-1.0000.1801.0000.2010.201
공간길이(SHAPE_LENG)-0.1920.9980.2011.0001.000
공간면적(SHAPE_AREA)-0.1920.9980.2011.0001.000

Missing values

2023-12-10T23:59:37.242857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:59:37.487732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

OBJECTID_1(OBJECTID_1)그리드코드(GRID50_CD)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
05418GS00005418202136464459200.0759472501.89903
131992GS00031992206766459031200.076742501.918877
252227GS00052227198028456234200.0747372501.868779
367035GS00067035208441454538200.0770362501.92626
497864GS00097864185398451615200.0720322501.801135
5133249GS00133249184561449109200.0718282501.796027
6134667GS00134667192464449051200.0735292501.838555
7153977GS00153977211179447550200.0776272501.941052
8177763GS00177763204388445413200.0762242501.90596
9212313GS00212313203803442459200.0760222501.900905
OBJECTID_1(OBJECTID_1)그리드코드(GRID50_CD)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
197452182GS00052182195777456222200.0743372501.858775
197553167GS00053167205831456175200.0766372501.916283
19765827GS00005827204587464372200.0762462501.906519
19777653GS00007653206591463882200.0767462501.91901
197811927GS00011927199594462644200.0751452501.878969
197913934GS00013934204449462170200.0762432501.906439
198015627GS00015627202500461760200.0758432501.896435
198116852GS00016852204952461522200.0763442501.908953
198217945GS00017945198902461240200.0750422501.876401
198319125GS00019125207706461087200.0768432501.921444