Overview

Dataset statistics

Number of variables10
Number of observations1888
Missing cells3776
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory153.2 KiB
Average record size in memory83.1 B

Variable types

Numeric1
Text2
Categorical5
Unsupported2

Dataset

Description순번,ID,도시계획코드,분류명,조서ID,고시ID,라벨명,고시일자,X좌표,Y좌표
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15529/S/1/datasetView.do

Alerts

도시계획코드 has constant value ""Constant
분류명 has constant value ""Constant
조서ID has constant value ""Constant
고시ID has constant value ""Constant
고시일자 has constant value ""Constant
X좌표 has 1888 (100.0%) missing valuesMissing
Y좌표 has 1888 (100.0%) missing valuesMissing
순번 has unique valuesUnique
ID has unique valuesUnique
X좌표 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Y좌표 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 08:40:50.260550
Analysis finished2024-05-11 08:40:52.738044
Duration2.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1888
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68144.5
Minimum67201
Maximum69088
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.7 KiB
2024-05-11T08:40:52.957213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum67201
5-th percentile67295.35
Q167672.75
median68144.5
Q368616.25
95-th percentile68993.65
Maximum69088
Range1887
Interquartile range (IQR)943.5

Descriptive statistics

Standard deviation545.16297
Coefficient of variation (CV)0.0080001023
Kurtosis-1.2
Mean68144.5
Median Absolute Deviation (MAD)472
Skewness0
Sum1.2865682 × 108
Variance297202.67
MonotonicityNot monotonic
2024-05-11T08:40:53.675666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
67334 1
 
0.1%
68312 1
 
0.1%
68732 1
 
0.1%
68731 1
 
0.1%
68730 1
 
0.1%
68729 1
 
0.1%
68728 1
 
0.1%
68727 1
 
0.1%
68726 1
 
0.1%
68725 1
 
0.1%
Other values (1878) 1878
99.5%
ValueCountFrequency (%)
67201 1
0.1%
67202 1
0.1%
67203 1
0.1%
67204 1
0.1%
67205 1
0.1%
67206 1
0.1%
67207 1
0.1%
67208 1
0.1%
67209 1
0.1%
67210 1
0.1%
ValueCountFrequency (%)
69088 1
0.1%
69087 1
0.1%
69086 1
0.1%
69085 1
0.1%
69084 1
0.1%
69083 1
0.1%
69082 1
0.1%
69081 1
0.1%
69080 1
0.1%
69079 1
0.1%

ID
Text

UNIQUE 

Distinct1888
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
2024-05-11T08:40:54.302406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters28320
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1888 ?
Unique (%)100.0%

Sample

1st row생활서비스시설_공원_0966
2nd row생활서비스시설_공원_0967
3rd row생활서비스시설_공원_0968
4th row생활서비스시설_공원_0969
5th row생활서비스시설_공원_0971
ValueCountFrequency (%)
생활서비스시설_공원_0966 1
 
0.1%
생활서비스시설_공원_1277 1
 
0.1%
생활서비스시설_공원_0127 1
 
0.1%
생활서비스시설_공원_0125 1
 
0.1%
생활서비스시설_공원_0124 1
 
0.1%
생활서비스시설_공원_0122 1
 
0.1%
생활서비스시설_공원_0120 1
 
0.1%
생활서비스시설_공원_0118 1
 
0.1%
생활서비스시설_공원_0116 1
 
0.1%
생활서비스시설_공원_1294 1
 
0.1%
Other values (1878) 1878
99.5%
2024-05-11T08:40:55.643129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 3776
13.3%
1888
 
6.7%
1888
 
6.7%
1888
 
6.7%
1888
 
6.7%
1888
 
6.7%
1888
 
6.7%
1888
 
6.7%
1888
 
6.7%
1888
 
6.7%
Other values (10) 7552
26.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16992
60.0%
Decimal Number 7552
26.7%
Connector Punctuation 3776
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1575
20.9%
1 1468
19.4%
6 579
 
7.7%
7 579
 
7.7%
4 579
 
7.7%
5 579
 
7.7%
2 579
 
7.7%
3 579
 
7.7%
8 567
 
7.5%
9 468
 
6.2%
Other Letter
ValueCountFrequency (%)
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
Connector Punctuation
ValueCountFrequency (%)
_ 3776
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16992
60.0%
Common 11328
40.0%

Most frequent character per script

Common
ValueCountFrequency (%)
_ 3776
33.3%
0 1575
13.9%
1 1468
 
13.0%
6 579
 
5.1%
7 579
 
5.1%
4 579
 
5.1%
5 579
 
5.1%
2 579
 
5.1%
3 579
 
5.1%
8 567
 
5.0%
Hangul
ValueCountFrequency (%)
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16992
60.0%
ASCII 11328
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 3776
33.3%
0 1575
13.9%
1 1468
 
13.0%
6 579
 
5.1%
7 579
 
5.1%
4 579
 
5.1%
5 579
 
5.1%
2 579
 
5.1%
3 579
 
5.1%
8 567
 
5.0%
Hangul
ValueCountFrequency (%)
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%
1888
11.1%

도시계획코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
ZON216
1888 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowZON216
2nd rowZON216
3rd rowZON216
4th rowZON216
5th rowZON216

Common Values

ValueCountFrequency (%)
ZON216 1888
100.0%

Length

2024-05-11T08:40:56.204895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:40:56.609583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
zon216 1888
100.0%

분류명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
생활서비스시설_공원
1888 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생활서비스시설_공원
2nd row생활서비스시설_공원
3rd row생활서비스시설_공원
4th row생활서비스시설_공원
5th row생활서비스시설_공원

Common Values

ValueCountFrequency (%)
생활서비스시설_공원 1888
100.0%

Length

2024-05-11T08:40:56.956690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:40:57.339411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활서비스시설_공원 1888
100.0%

조서ID
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
1888 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1888
100.0%

Length

2024-05-11T08:40:58.141664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:40:58.567608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
No values found.

고시ID
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
1888 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1888
100.0%

Length

2024-05-11T08:40:58.904216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:40:59.226472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
No values found.
Distinct1584
Distinct (%)83.9%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
2024-05-11T08:40:59.869897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length10.019068
Min length7

Characters and Unicode

Total characters18916
Distinct characters435
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1414 ?
Unique (%)74.9%

Sample

1st row어린이공원(꽃사슴)
2nd row어린이공원(목련)
3rd row어린이공원(신기)
4th row마을마당(신정3)
5th row어린이공원(동개울)
ValueCountFrequency (%)
어린이공원(개나리 11
 
0.6%
어린이공원(장미 10
 
0.5%
어린이공원(샛별 10
 
0.5%
어린이공원(새싹 9
 
0.5%
어린이공원(까치 9
 
0.5%
어린이공원(동산 8
 
0.4%
어린이공원(무궁화 8
 
0.4%
어린이공원(진달래 7
 
0.4%
어린이공원(무지개 7
 
0.4%
어린이공원(꿈나무 6
 
0.3%
Other values (1581) 1817
95.5%
2024-05-11T08:41:01.214639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 1943
 
10.3%
( 1942
 
10.3%
1737
 
9.2%
1702
 
9.0%
1344
 
7.1%
1122
 
5.9%
1072
 
5.7%
739
 
3.9%
376
 
2.0%
360
 
1.9%
Other values (425) 6579
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14553
76.9%
Close Punctuation 1943
 
10.3%
Open Punctuation 1942
 
10.3%
Decimal Number 294
 
1.6%
Math Symbol 164
 
0.9%
Space Separator 14
 
0.1%
Dash Punctuation 4
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1737
 
11.9%
1702
 
11.7%
1344
 
9.2%
1122
 
7.7%
1072
 
7.4%
739
 
5.1%
376
 
2.6%
360
 
2.5%
333
 
2.3%
267
 
1.8%
Other values (408) 5501
37.8%
Decimal Number
ValueCountFrequency (%)
1 99
33.7%
2 94
32.0%
3 49
16.7%
4 18
 
6.1%
6 11
 
3.7%
7 11
 
3.7%
5 9
 
3.1%
8 2
 
0.7%
0 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
> 82
50.0%
< 82
50.0%
Other Punctuation
ValueCountFrequency (%)
? 1
50.0%
. 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 1943
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1942
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14553
76.9%
Common 4363
 
23.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1737
 
11.9%
1702
 
11.7%
1344
 
9.2%
1122
 
7.7%
1072
 
7.4%
739
 
5.1%
376
 
2.6%
360
 
2.5%
333
 
2.3%
267
 
1.8%
Other values (408) 5501
37.8%
Common
ValueCountFrequency (%)
) 1943
44.5%
( 1942
44.5%
1 99
 
2.3%
2 94
 
2.2%
> 82
 
1.9%
< 82
 
1.9%
3 49
 
1.1%
4 18
 
0.4%
14
 
0.3%
6 11
 
0.3%
Other values (7) 29
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14553
76.9%
ASCII 4363
 
23.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 1943
44.5%
( 1942
44.5%
1 99
 
2.3%
2 94
 
2.2%
> 82
 
1.9%
< 82
 
1.9%
3 49
 
1.1%
4 18
 
0.4%
14
 
0.3%
6 11
 
0.3%
Other values (7) 29
 
0.7%
Hangul
ValueCountFrequency (%)
1737
 
11.9%
1702
 
11.7%
1344
 
9.2%
1122
 
7.7%
1072
 
7.4%
739
 
5.1%
376
 
2.6%
360
 
2.5%
333
 
2.3%
267
 
1.8%
Other values (408) 5501
37.8%

고시일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
1888 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1888
100.0%

Length

2024-05-11T08:41:01.666520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:41:02.168020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
No values found.

X좌표
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1888
Missing (%)100.0%
Memory size16.7 KiB

Y좌표
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1888
Missing (%)100.0%
Memory size16.7 KiB

Interactions

2024-05-11T08:40:51.097770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-05-11T08:40:51.598302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T08:40:52.569414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번ID도시계획코드분류명조서ID고시ID라벨명고시일자X좌표Y좌표
067334생활서비스시설_공원_0966ZON216생활서비스시설_공원어린이공원(꽃사슴)<NA><NA>
167335생활서비스시설_공원_0967ZON216생활서비스시설_공원어린이공원(목련)<NA><NA>
267336생활서비스시설_공원_0968ZON216생활서비스시설_공원어린이공원(신기)<NA><NA>
367337생활서비스시설_공원_0969ZON216생활서비스시설_공원마을마당(신정3)<NA><NA>
467338생활서비스시설_공원_0971ZON216생활서비스시설_공원어린이공원(동개울)<NA><NA>
567339생활서비스시설_공원_0974ZON216생활서비스시설_공원어린이공원(방아다리)<NA><NA>
667340생활서비스시설_공원_0975ZON216생활서비스시설_공원어린이공원(별님)<NA><NA>
767341생활서비스시설_공원_0977ZON216생활서비스시설_공원근린공원(오솔길)<NA><NA>
867342생활서비스시설_공원_0979ZON216생활서비스시설_공원근린공원(한울)<NA><NA>
967343생활서비스시설_공원_0980ZON216생활서비스시설_공원어린이공원(꿀벌)<NA><NA>
순번ID도시계획코드분류명조서ID고시ID라벨명고시일자X좌표Y좌표
187867992생활서비스시설_공원_1046ZON216생활서비스시설_공원근린공원(서낭당)<NA><NA>
187967993생활서비스시설_공원_1050ZON216생활서비스시설_공원어린이공원(능말)<NA><NA>
188067994생활서비스시설_공원_1054ZON216생활서비스시설_공원어린이공원(초롱)<NA><NA>
188167995생활서비스시설_공원_1057ZON216생활서비스시설_공원어린이공원(새싹)<NA><NA>
188267996생활서비스시설_공원_1059ZON216생활서비스시설_공원어린이공원(부석)<NA><NA>
188367997생활서비스시설_공원_1062ZON216생활서비스시설_공원체육공원(마곡(미조성))<NA><NA>
188467998생활서비스시설_공원_1065ZON216생활서비스시설_공원어린이공원(쌈지)<NA><NA>
188567999생활서비스시설_공원_1070ZON216생활서비스시설_공원기타공원(참새)<NA><NA>
188668000생활서비스시설_공원_1073ZON216생활서비스시설_공원어린이공원(까치산)<NA><NA>
188768001생활서비스시설_공원_1077ZON216생활서비스시설_공원근린공원(우장<시공원>)<NA><NA>