Overview

Dataset statistics

Number of variables8
Number of observations37
Missing cells74
Missing cells (%)25.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory68.5 B

Variable types

Text4
Numeric1
Categorical1
Unsupported1
Boolean1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-319/S/1/datasetView.do

Alerts

테이블영문명 has constant value ""Constant
테이블한글명 has constant value ""Constant
컬럼순서 is highly overall correlated with NullHigh correlation
Null is highly overall correlated with 컬럼순서High correlation
테이블영문명 has 36 (97.3%) missing valuesMissing
테이블한글명 has 36 (97.3%) missing valuesMissing
길이 has 2 (5.4%) missing valuesMissing
컬럼순서 has unique valuesUnique
컬럼영문명 has unique valuesUnique
컬럼한글명 has unique valuesUnique
길이 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-20 21:33:26.668526
Analysis finished2024-04-20 21:33:28.265768
Duration1.6 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

테이블영문명
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing36
Missing (%)97.3%
Memory size424.0 B
2024-04-21T06:33:28.647346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length22
Mean length22
Min length22

Characters and Unicode

Total characters22
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st rowTN_PARK_ND_PVTLND_WDPT
ValueCountFrequency (%)
tn_park_nd_pvtlnd_wdpt 1
100.0%
2024-04-21T06:33:29.427744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 4
18.2%
T 3
13.6%
N 3
13.6%
P 3
13.6%
D 3
13.6%
A 1
 
4.5%
R 1
 
4.5%
K 1
 
4.5%
V 1
 
4.5%
L 1
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 18
81.8%
Connector Punctuation 4
 
18.2%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 3
16.7%
N 3
16.7%
P 3
16.7%
D 3
16.7%
A 1
 
5.6%
R 1
 
5.6%
K 1
 
5.6%
V 1
 
5.6%
L 1
 
5.6%
W 1
 
5.6%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 18
81.8%
Common 4
 
18.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 3
16.7%
N 3
16.7%
P 3
16.7%
D 3
16.7%
A 1
 
5.6%
R 1
 
5.6%
K 1
 
5.6%
V 1
 
5.6%
L 1
 
5.6%
W 1
 
5.6%
Common
ValueCountFrequency (%)
_ 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 4
18.2%
T 3
13.6%
N 3
13.6%
P 3
13.6%
D 3
13.6%
A 1
 
4.5%
R 1
 
4.5%
K 1
 
4.5%
V 1
 
4.5%
L 1
 
4.5%

테이블한글명
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing36
Missing (%)97.3%
Memory size424.0 B
2024-04-21T06:33:29.860402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters8
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row공원및사유지수목
ValueCountFrequency (%)
공원및사유지수목 1
100.0%
2024-04-21T06:33:30.483014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

컬럼순서
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size461.0 B
2024-04-21T06:33:30.710815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q110
median19
Q328
95-th percentile35.2
Maximum37
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.824355
Coefficient of variation (CV)0.56970291
Kurtosis-1.2
Mean19
Median Absolute Deviation (MAD)9
Skewness0
Sum703
Variance117.16667
MonotonicityStrictly increasing
2024-04-21T06:33:30.961189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 1
 
2.7%
29 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
30 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
37 1
2.7%
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%

컬럼영문명
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size424.0 B
2024-04-21T06:33:31.710243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length6.8648649
Min length1

Characters and Unicode

Total characters254
Distinct characters25
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st rowOBJECTID
2nd rowGU_NM
3rd rowHNR_NAM
4th rowMTC_AT
5th rowMASTERNO
ValueCountFrequency (%)
objectid 1
 
2.7%
itm_ery 1
 
2.7%
tre_som 1
 
2.7%
rnk_lc_cn 1
 
2.7%
mge_lvl 1
 
2.7%
spy_sttn 1
 
2.7%
dme_sttn 1
 
2.7%
regist_no 1
 
2.7%
pss_man 1
 
2.7%
scncenm_nm 1
 
2.7%
Other values (27) 27
73.0%
2024-04-21T06:33:32.811056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 34
13.4%
T 26
 
10.2%
N 26
 
10.2%
M 23
 
9.1%
E 21
 
8.3%
R 14
 
5.5%
S 13
 
5.1%
G 11
 
4.3%
O 9
 
3.5%
D 9
 
3.5%
Other values (15) 68
26.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 220
86.6%
Connector Punctuation 34
 
13.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 26
11.8%
N 26
11.8%
M 23
 
10.5%
E 21
 
9.5%
R 14
 
6.4%
S 13
 
5.9%
G 11
 
5.0%
O 9
 
4.1%
D 9
 
4.1%
C 9
 
4.1%
Other values (14) 59
26.8%
Connector Punctuation
ValueCountFrequency (%)
_ 34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 220
86.6%
Common 34
 
13.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 26
11.8%
N 26
11.8%
M 23
 
10.5%
E 21
 
9.5%
R 14
 
6.4%
S 13
 
5.9%
G 11
 
5.0%
O 9
 
4.1%
D 9
 
4.1%
C 9
 
4.1%
Other values (14) 59
26.8%
Common
ValueCountFrequency (%)
_ 34
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 254
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 34
13.4%
T 26
 
10.2%
N 26
 
10.2%
M 23
 
9.1%
E 21
 
8.3%
R 14
 
5.5%
S 13
 
5.1%
G 11
 
4.3%
O 9
 
3.5%
D 9
 
3.5%
Other values (15) 68
26.8%

컬럼한글명
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size424.0 B
2024-04-21T06:33:33.551683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length3.6486486
Min length2

Characters and Unicode

Total characters135
Distinct characters60
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row고유번호
2nd row구명
3rd row법정동명
4th row산지여부
5th row주지번
ValueCountFrequency (%)
좌표 2
 
5.1%
고유번호 1
 
2.6%
학명 1
 
2.6%
품계수종 1
 
2.6%
품계위치 1
 
2.6%
관리등급 1
 
2.6%
지원사항 1
 
2.6%
피해상태 1
 
2.6%
등록번호 1
 
2.6%
소유자 1
 
2.6%
Other values (28) 28
71.8%
2024-04-21T06:33:34.703733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
5.2%
7
 
5.2%
6
 
4.4%
6
 
4.4%
6
 
4.4%
6
 
4.4%
5
 
3.7%
5
 
3.7%
4
 
3.0%
4
 
3.0%
Other values (50) 79
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 131
97.0%
Space Separator 2
 
1.5%
Uppercase Letter 2
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
5.3%
7
 
5.3%
6
 
4.6%
6
 
4.6%
6
 
4.6%
6
 
4.6%
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
Other values (47) 75
57.3%
Uppercase Letter
ValueCountFrequency (%)
X 1
50.0%
Y 1
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 131
97.0%
Common 2
 
1.5%
Latin 2
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
5.3%
7
 
5.3%
6
 
4.6%
6
 
4.6%
6
 
4.6%
6
 
4.6%
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
Other values (47) 75
57.3%
Latin
ValueCountFrequency (%)
X 1
50.0%
Y 1
50.0%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 131
97.0%
ASCII 4
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
5.3%
7
 
5.3%
6
 
4.6%
6
 
4.6%
6
 
4.6%
6
 
4.6%
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
Other values (47) 75
57.3%
ASCII
ValueCountFrequency (%)
2
50.0%
X 1
25.0%
Y 1
25.0%

데이터타입
Categorical

Distinct3
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size424.0 B
NVARCHAR2
24 
NUMBER
12 
DATE
 
1

Length

Max length9
Median length9
Mean length7.8918919
Min length4

Unique

Unique1 ?
Unique (%)2.7%

Sample

1st rowNUMBER
2nd rowNVARCHAR2
3rd rowNVARCHAR2
4th rowNVARCHAR2
5th rowNVARCHAR2

Common Values

ValueCountFrequency (%)
NVARCHAR2 24
64.9%
NUMBER 12
32.4%
DATE 1
 
2.7%

Length

2024-04-21T06:33:35.120112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T06:33:35.448263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
nvarchar2 24
64.9%
number 12
32.4%
date 1
 
2.7%

길이
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.4%
Memory size424.0 B

Null
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size165.0 B
True
29 
False
ValueCountFrequency (%)
True 29
78.4%
False 8
 
21.6%
2024-04-21T06:33:35.750956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2024-04-21T06:33:27.025552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T06:33:35.948006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
컬럼순서컬럼영문명컬럼한글명데이터타입Null
컬럼순서1.0001.0001.0000.4200.616
컬럼영문명1.0001.0001.0001.0001.000
컬럼한글명1.0001.0001.0001.0001.000
데이터타입0.4201.0001.0001.0000.000
Null0.6161.0001.0000.0001.000
2024-04-21T06:33:36.168226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터타입Null
데이터타입1.0000.000
Null0.0001.000
2024-04-21T06:33:36.410447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
컬럼순서데이터타입Null
컬럼순서1.0000.1050.541
데이터타입0.1051.0000.000
Null0.5410.0001.000

Missing values

2024-04-21T06:33:27.384096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T06:33:27.796769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T06:33:28.114768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

테이블영문명테이블한글명컬럼순서컬럼영문명컬럼한글명데이터타입길이Null
0TN_PARK_ND_PVTLND_WDPT공원및사유지수목1OBJECTID고유번호NUMBERNaNN
1<NA><NA>2GU_NM구명NVARCHAR2252Y
2<NA><NA>3HNR_NAM법정동명NVARCHAR250Y
3<NA><NA>4MTC_AT산지여부NVARCHAR21Y
4<NA><NA>5MASTERNO주지번NVARCHAR24Y
5<NA><NA>6SLAVENO부지번NVARCHAR24Y
6<NA><NA>7NEADRES_NM새주소명NVARCHAR290Y
7<NA><NA>8TRE_IDN수목고유번호NVARCHAR250Y
8<NA><NA>9GU_NO구번호NVARCHAR2200Y
9<NA><NA>10DONG_NM동명NVARCHAR2200N
테이블영문명테이블한글명컬럼순서컬럼영문명컬럼한글명데이터타입길이Null
27<NA><NA>28PSS_MAN소유자NVARCHAR2250Y
28<NA><NA>29SCNCENM_NM학명NVARCHAR2750Y
29<NA><NA>30VTN_ERY식생활력NUMBER38,8Y
30<NA><NA>31ITM_LVL품계등급NUMBER38,8Y
31<NA><NA>32MGE_MAN관리자NVARCHAR2250Y
32<NA><NA>33MGE_ORG관리기관NVARCHAR250Y
33<NA><NA>34CREAT_DE생성일DATENaNY
34<NA><NA>35PO_FE_NM사진파일명NVARCHAR230Y
35<NA><NA>36XX 좌표NVARCHAR211Y
36<NA><NA>37YY 좌표NVARCHAR211Y