Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells20000
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory888.7 KiB
Average record size in memory91.0 B

Variable types

Numeric1
Text1
Categorical6
Unsupported2

Dataset

Description순번,ID,도시계획코드,분류명,조서ID,고시ID,라벨명,고시일자,X좌표,Y좌표
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15530/S/1/datasetView.do

Alerts

조서ID has constant value ""Constant
고시ID has constant value ""Constant
고시일자 has constant value ""Constant
라벨명 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
분류명 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
도시계획코드 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
순번 is highly overall correlated with 도시계획코드 and 2 other fieldsHigh correlation
X좌표 has 10000 (100.0%) missing valuesMissing
Y좌표 has 10000 (100.0%) missing valuesMissing
순번 has unique valuesUnique
ID has unique valuesUnique
X좌표 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Y좌표 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 09:41:53.160522
Analysis finished2024-05-11 09:41:55.467424
Duration2.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7711595.6
Minimum7661530
Maximum7762170
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T09:41:55.793199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7661530
5-th percentile7666718.5
Q17686289.5
median7711689
Q37736521.8
95-th percentile7756952.3
Maximum7762170
Range100640
Interquartile range (IQR)50232.25

Descriptive statistics

Standard deviation28944.019
Coefficient of variation (CV)0.0037533113
Kurtosis-1.2018814
Mean7711595.6
Median Absolute Deviation (MAD)25109
Skewness0.0085184355
Sum7.7115956 × 1010
Variance8.3775624 × 108
MonotonicityNot monotonic
2024-05-11T09:41:56.345678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7725126 1
 
< 0.1%
7664731 1
 
< 0.1%
7701028 1
 
< 0.1%
7739241 1
 
< 0.1%
7754656 1
 
< 0.1%
7711958 1
 
< 0.1%
7743731 1
 
< 0.1%
7708380 1
 
< 0.1%
7691694 1
 
< 0.1%
7707510 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
7661530 1
< 0.1%
7661532 1
< 0.1%
7661533 1
< 0.1%
7661784 1
< 0.1%
7661791 1
< 0.1%
7661798 1
< 0.1%
7661804 1
< 0.1%
7661827 1
< 0.1%
7661846 1
< 0.1%
7661857 1
< 0.1%
ValueCountFrequency (%)
7762170 1
< 0.1%
7762166 1
< 0.1%
7762138 1
< 0.1%
7762123 1
< 0.1%
7762118 1
< 0.1%
7762116 1
< 0.1%
7762115 1
< 0.1%
7762107 1
< 0.1%
7762102 1
< 0.1%
7762091 1
< 0.1%

ID
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T09:41:57.062533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length18.9999
Min length18

Characters and Unicode

Total characters189999
Distinct characters22
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row생활서비스시설_소외지역_062342
2nd row생활서비스시설_소외지역_063818
3rd row생활서비스시설_소외지역_040356
4th row생활서비스시설_소외지역_033519
5th row생활서비스시설_소외지역_004593
ValueCountFrequency (%)
생활서비스시설_소외지역_062342 1
 
< 0.1%
생활서비스시설_소외지역_044217 1
 
< 0.1%
생활서비스시설_소외지역_004160 1
 
< 0.1%
생활서비스시설_소외지역_054447 1
 
< 0.1%
생활서비스시설_소외지역_038790 1
 
< 0.1%
생활서비스시설_소외지역_076422 1
 
< 0.1%
생활서비스시설_소외지역_094878 1
 
< 0.1%
생활서비스시설_소외지역_050982 1
 
< 0.1%
생활서비스시설_소외지역_075571 1
 
< 0.1%
생활서비스시설_소외지역_002616 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-05-11T09:41:58.449038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 20000
 
10.5%
0 14979
 
7.9%
10000
 
5.3%
10000
 
5.3%
10000
 
5.3%
10000
 
5.3%
10000
 
5.3%
10000
 
5.3%
10000
 
5.3%
10000
 
5.3%
Other values (12) 75020
39.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 110000
57.9%
Decimal Number 59999
31.6%
Connector Punctuation 20000
 
10.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
Decimal Number
ValueCountFrequency (%)
0 14979
25.0%
1 5297
 
8.8%
3 5100
 
8.5%
5 5097
 
8.5%
2 5067
 
8.4%
7 4929
 
8.2%
4 4927
 
8.2%
6 4891
 
8.2%
8 4890
 
8.2%
9 4822
 
8.0%
Connector Punctuation
ValueCountFrequency (%)
_ 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110000
57.9%
Common 79999
42.1%

Most frequent character per script

Common
ValueCountFrequency (%)
_ 20000
25.0%
0 14979
18.7%
1 5297
 
6.6%
3 5100
 
6.4%
5 5097
 
6.4%
2 5067
 
6.3%
7 4929
 
6.2%
4 4927
 
6.2%
6 4891
 
6.1%
8 4890
 
6.1%
Hangul
ValueCountFrequency (%)
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 110000
57.9%
ASCII 79999
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 20000
25.0%
0 14979
18.7%
1 5297
 
6.6%
3 5100
 
6.4%
5 5097
 
6.4%
2 5067
 
6.3%
7 4929
 
6.2%
4 4927
 
6.2%
6 4891
 
6.1%
8 4890
 
6.1%
Hangul
ValueCountFrequency (%)
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%
10000
9.1%

도시계획코드
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
ZON314
4638 
ZON312
2375 
ZON320
2061 
ZON322
627 
ZON318
 
198

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowZON320
2nd rowZON320
3rd rowZON314
4th rowZON314
5th rowZON314

Common Values

ValueCountFrequency (%)
ZON314 4638
46.4%
ZON312 2375
23.8%
ZON320 2061
20.6%
ZON322 627
 
6.3%
ZON318 198
 
2.0%
ZON316 101
 
1.0%

Length

2024-05-11T09:41:58.960162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:41:59.414010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
zon314 4638
46.4%
zon312 2375
23.8%
zon320 2061
20.6%
zon322 627
 
6.3%
zon318 198
 
2.0%
zon316 101
 
1.0%

분류명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
소외지역_노인여가
4638 
소외지역_공원
2375 
소외지역_주차장
2061 
소외지역_청소년아동
627 
소외지역_어린이집
 
198

Length

Max length10
Median length9
Mean length8.3715
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소외지역_주차장
2nd row소외지역_주차장
3rd row소외지역_노인여가
4th row소외지역_노인여가
5th row소외지역_노인여가

Common Values

ValueCountFrequency (%)
소외지역_노인여가 4638
46.4%
소외지역_공원 2375
23.8%
소외지역_주차장 2061
20.6%
소외지역_청소년아동 627
 
6.3%
소외지역_어린이집 198
 
2.0%
소외지역_도서관 101
 
1.0%

Length

2024-05-11T09:42:00.021953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:42:00.516547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소외지역_노인여가 4638
46.4%
소외지역_공원 2375
23.8%
소외지역_주차장 2061
20.6%
소외지역_청소년아동 627
 
6.3%
소외지역_어린이집 198
 
2.0%
소외지역_도서관 101
 
1.0%

조서ID
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
10000
100.0%

Length

2024-05-11T09:42:00.929251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:42:01.332925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
No values found.

고시ID
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
10000
100.0%

Length

2024-05-11T09:42:01.826395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:42:02.345389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
No values found.

라벨명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
노인여가
4638 
공원
2375 
주차장
2061 
청소년아동
627 
어린이집
 
198

Length

Max length5
Median length4
Mean length3.3715
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주차장
2nd row주차장
3rd row노인여가
4th row노인여가
5th row노인여가

Common Values

ValueCountFrequency (%)
노인여가 4638
46.4%
공원 2375
23.8%
주차장 2061
20.6%
청소년아동 627
 
6.3%
어린이집 198
 
2.0%
도서관 101
 
1.0%

Length

2024-05-11T09:42:02.784747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:42:03.170758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노인여가 4638
46.4%
공원 2375
23.8%
주차장 2061
20.6%
청소년아동 627
 
6.3%
어린이집 198
 
2.0%
도서관 101
 
1.0%

고시일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
10000
100.0%

Length

2024-05-11T09:42:03.762521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:42:04.154790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
No values found.

X좌표
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Y좌표
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Interactions

2024-05-11T09:41:54.262648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T09:42:04.455093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번도시계획코드분류명라벨명
순번1.0000.8620.8620.862
도시계획코드0.8621.0001.0001.000
분류명0.8621.0001.0001.000
라벨명0.8621.0001.0001.000
2024-05-11T09:42:04.785538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
라벨명분류명도시계획코드
라벨명1.0001.0001.000
분류명1.0001.0001.000
도시계획코드1.0001.0001.000
2024-05-11T09:42:05.130248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번도시계획코드분류명라벨명
순번1.0000.6850.6850.685
도시계획코드0.6851.0001.0001.000
분류명0.6851.0001.0001.000
라벨명0.6851.0001.0001.000

Missing values

2024-05-11T09:41:54.699872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T09:41:55.246963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번ID도시계획코드분류명조서ID고시ID라벨명고시일자X좌표Y좌표
623037725126생활서비스시설_소외지역_062342ZON320소외지역_주차장주차장<NA><NA>
648247725604생활서비스시설_소외지역_063818ZON320소외지역_주차장주차장<NA><NA>
388487700149생활서비스시설_소외지역_040356ZON314소외지역_노인여가노인여가<NA><NA>
362307697698생활서비스시설_소외지역_033519ZON314소외지역_노인여가노인여가<NA><NA>
46807666929생활서비스시설_소외지역_004593ZON314소외지역_노인여가노인여가<NA><NA>
862697747220생활서비스시설_소외지역_084180ZON312소외지역_공원공원<NA><NA>
291867689552생활서비스시설_소외지역_029318ZON314소외지역_노인여가노인여가<NA><NA>
265387687527생활서비스시설_소외지역_024510ZON314소외지역_노인여가노인여가<NA><NA>
705407732111생활서비스시설_소외지역_071570ZON322소외지역_청소년아동청소년아동<NA><NA>
435307704600생활서비스시설_소외지역_043305ZON314소외지역_노인여가노인여가<NA><NA>
순번ID도시계획코드분류명조서ID고시ID라벨명고시일자X좌표Y좌표
339427695991생활서비스시설_소외지역_027572ZON314소외지역_노인여가노인여가<NA><NA>
745797736395생활서비스시설_소외지역_074872ZON322소외지역_청소년아동청소년아동<NA><NA>
379117699887생활서비스시설_소외지역_037330ZON314소외지역_노인여가노인여가<NA><NA>
696837731778생활서비스시설_소외지역_072306ZON322소외지역_청소년아동청소년아동<NA><NA>
337587696669생활서비스시설_소외지역_034370ZON314소외지역_노인여가노인여가<NA><NA>
237977685909생활서비스시설_소외지역_024410ZON314소외지역_노인여가노인여가<NA><NA>
983877759055생활서비스시설_소외지역_095756ZON312소외지역_공원공원<NA><NA>
223217684066생활서비스시설_소외지역_022932ZON314소외지역_노인여가노인여가<NA><NA>
694597731721생활서비스시설_소외지역_069395ZON320소외지역_주차장주차장<NA><NA>
912157754118생활서비스시설_소외지역_092776ZON312소외지역_공원공원<NA><NA>