Overview

Dataset statistics

Number of variables7
Number of observations68
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)7.4%
Total size in memory3.9 KiB
Average record size in memory58.9 B

Variable types

Categorical4
Text2
Numeric1

Dataset

Description파일 다운로드
Author강서구
URLhttps://data.seoul.go.kr/dataList/OA-21803/F/1/datasetView.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
Dataset has 5 (7.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 04:17:58.399187
Analysis finished2023-12-11 04:17:59.074507
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
서울특별시
68 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 68
100.0%

Length

2023-12-11T13:17:59.145798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T13:17:59.249882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 68
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
강서구
68 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강서구
2nd row강서구
3rd row강서구
4th row강서구
5th row강서구

Common Values

ValueCountFrequency (%)
강서구 68
100.0%

Length

2023-12-11T13:17:59.363611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T13:17:59.575026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강서구 68
100.0%
Distinct54
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-11T13:17:59.803856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length10
Mean length5.9705882
Min length2

Characters and Unicode

Total characters406
Distinct characters165
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)70.6%

Sample

1st row마곡중앙광장
2nd row공판장동
3rd row관리,트럭판매동
4th row점포동
5th row청과물동
ValueCountFrequency (%)
미기재 10
 
12.3%
홈플러스 2
 
2.5%
관리,트럭판매동 2
 
2.5%
메이필드호텔 2
 
2.5%
호텔 2
 
2.5%
강서점 2
 
2.5%
가양점 2
 
2.5%
김포공항 2
 
2.5%
점포동 2
 
2.5%
청과물동 2
 
2.5%
Other values (51) 53
65.4%
2023-12-11T13:18:00.318114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
3.2%
13
 
3.2%
12
 
3.0%
11
 
2.7%
11
 
2.7%
10
 
2.5%
10
 
2.5%
10
 
2.5%
10
 
2.5%
8
 
2.0%
Other values (155) 298
73.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 378
93.1%
Space Separator 13
 
3.2%
Uppercase Letter 10
 
2.5%
Decimal Number 3
 
0.7%
Other Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
3.4%
12
 
3.2%
11
 
2.9%
11
 
2.9%
10
 
2.6%
10
 
2.6%
10
 
2.6%
10
 
2.6%
8
 
2.1%
7
 
1.9%
Other values (141) 276
73.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
20.0%
T 1
10.0%
A 1
10.0%
I 1
10.0%
K 1
10.0%
J 1
10.0%
V 1
10.0%
G 1
10.0%
N 1
10.0%
Decimal Number
ValueCountFrequency (%)
8 1
33.3%
2 1
33.3%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 378
93.1%
Common 18
 
4.4%
Latin 10
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
3.4%
12
 
3.2%
11
 
2.9%
11
 
2.9%
10
 
2.6%
10
 
2.6%
10
 
2.6%
10
 
2.6%
8
 
2.1%
7
 
1.9%
Other values (141) 276
73.0%
Latin
ValueCountFrequency (%)
C 2
20.0%
T 1
10.0%
A 1
10.0%
I 1
10.0%
K 1
10.0%
J 1
10.0%
V 1
10.0%
G 1
10.0%
N 1
10.0%
Common
ValueCountFrequency (%)
13
72.2%
, 2
 
11.1%
8 1
 
5.6%
2 1
 
5.6%
1 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 378
93.1%
ASCII 28
 
6.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13
46.4%
C 2
 
7.1%
, 2
 
7.1%
T 1
 
3.6%
A 1
 
3.6%
I 1
 
3.6%
8 1
 
3.6%
2 1
 
3.6%
1 1
 
3.6%
K 1
 
3.6%
Other values (4) 4
 
14.3%
Hangul
ValueCountFrequency (%)
13
 
3.4%
12
 
3.2%
11
 
2.9%
11
 
2.9%
10
 
2.6%
10
 
2.6%
10
 
2.6%
10
 
2.6%
8
 
2.1%
7
 
1.9%
Other values (141) 276
73.0%

시설 연락처
Categorical

Distinct24
Distinct (%)35.3%
Missing0
Missing (%)0.0%
Memory size676.0 B
미기재
45 
02-2063-2080
 
1
02-2101-1053
 
1
02-6733-6000
 
1
02-2007-1271
 
1
Other values (19)
19 

Length

Max length12
Median length3
Mean length6.0441176
Min length3

Unique

Unique23 ?
Unique (%)33.8%

Sample

1st row미기재
2nd row미기재
3rd row미기재
4th row미기재
5th row미기재

Common Values

ValueCountFrequency (%)
미기재 45
66.2%
02-2063-2080 1
 
1.5%
02-2101-1053 1
 
1.5%
02-6733-6000 1
 
1.5%
02-2007-1271 1
 
1.5%
02-2602-2801 1
 
1.5%
02-2660-8121 1
 
1.5%
02-1577-7582 1
 
1.5%
02-2667-9000 1
 
1.5%
02-2668-3600 1
 
1.5%
Other values (14) 14
 
20.6%

Length

2023-12-11T13:18:00.501974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미기재 45
66.2%
02-2063-2080 1
 
1.5%
02-3663-6562 1
 
1.5%
02-2663-1978 1
 
1.5%
02-6116-4000 1
 
1.5%
02-2660-7500 1
 
1.5%
02-6946-7000 1
 
1.5%
02-2602-6002 1
 
1.5%
02-2604-9229 1
 
1.5%
02-6116-1000 1
 
1.5%
Other values (14) 14
 
20.6%

주용도
Categorical

Distinct6
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size676.0 B
판매시설
21 
여객용운수시설
18 
종교시설
11 
관광숙박시설
10 
문화및집회시설

Length

Max length7
Median length4
Mean length5.2647059
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row판매시설
2nd row판매시설
3rd row판매시설
4th row판매시설
5th row판매시설

Common Values

ValueCountFrequency (%)
판매시설 21
30.9%
여객용운수시설 18
26.5%
종교시설 11
16.2%
관광숙박시설 10
14.7%
문화및집회시설 4
 
5.9%
종합병원 4
 
5.9%

Length

2023-12-11T13:18:00.652194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T13:18:00.781169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
판매시설 21
30.9%
여객용운수시설 18
26.5%
종교시설 11
16.2%
관광숙박시설 10
14.7%
문화및집회시설 4
 
5.9%
종합병원 4
 
5.9%

바닥면적(m2)
Real number (ℝ)

Distinct63
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36709.342
Minimum5003.4
Maximum316151.96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size744.0 B
2023-12-11T13:18:00.960717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5003.4
5-th percentile6154.6245
Q19389.675
median15206.13
Q329832.22
95-th percentile121818.36
Maximum316151.96
Range311148.56
Interquartile range (IQR)20442.545

Descriptive statistics

Standard deviation60399.548
Coefficient of variation (CV)1.6453454
Kurtosis13.275819
Mean36709.342
Median Absolute Deviation (MAD)6702.275
Skewness3.5229035
Sum2496235.2
Variance3.6481054 × 109
MonotonicityNot monotonic
2023-12-11T13:18:01.172699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
21133.86 2
 
2.9%
15681.74 2
 
2.9%
29832.22 2
 
2.9%
67588.48 2
 
2.9%
21108.14 2
 
2.9%
7981.7 1
 
1.5%
5398.87 1
 
1.5%
15537.28 1
 
1.5%
14650.92 1
 
1.5%
8682.0 1
 
1.5%
Other values (53) 53
77.9%
ValueCountFrequency (%)
5003.4 1
1.5%
5015.3 1
1.5%
5398.87 1
1.5%
6141.79 1
1.5%
6178.46 1
1.5%
6450.74 1
1.5%
6919.49 1
1.5%
7118.22 1
1.5%
7296.18 1
1.5%
7692.22 1
1.5%
ValueCountFrequency (%)
316151.96 1
1.5%
315098.0 1
1.5%
220179.85 1
1.5%
128313.07 1
1.5%
109756.77 1
1.5%
93863.12 1
1.5%
90313.7 1
1.5%
80922.07 1
1.5%
68534.85 1
1.5%
67588.48 2
2.9%
Distinct47
Distinct (%)69.1%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-11T13:18:01.514361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length20.161765
Min length15

Characters and Unicode

Total characters1371
Distinct characters47
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)57.4%

Sample

1st row서울특별시 강서구 마곡동 767-2번지
2nd row서울특별시 강서구 외발산동 424번지
3rd row서울특별시 강서구 외발산동 427번지
4th row서울특별시 강서구 외발산동 427번지
5th row서울특별시 강서구 외발산동 427번지
ValueCountFrequency (%)
강서구 68
25.4%
서울특별시 64
23.9%
공항동 10
 
3.7%
외발산동 10
 
3.7%
등촌동 9
 
3.4%
화곡동 8
 
3.0%
1373번지 7
 
2.6%
427번지 6
 
2.2%
내발산동 5
 
1.9%
마곡동 5
 
1.9%
Other values (54) 76
28.4%
2023-12-11T13:18:02.100622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
200
 
14.6%
132
 
9.6%
68
 
5.0%
68
 
5.0%
68
 
5.0%
64
 
4.7%
64
 
4.7%
64
 
4.7%
64
 
4.7%
63
 
4.6%
Other values (37) 516
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 885
64.6%
Decimal Number 251
 
18.3%
Space Separator 200
 
14.6%
Dash Punctuation 28
 
2.0%
Open Punctuation 4
 
0.3%
Close Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
132
14.9%
68
7.7%
68
7.7%
68
7.7%
64
 
7.2%
64
 
7.2%
64
 
7.2%
64
 
7.2%
63
 
7.1%
63
 
7.1%
Other values (23) 167
18.9%
Decimal Number
ValueCountFrequency (%)
7 38
15.1%
1 35
13.9%
2 29
11.6%
4 29
11.6%
3 27
10.8%
6 27
10.8%
8 22
8.8%
9 16
6.4%
0 15
 
6.0%
5 13
 
5.2%
Space Separator
ValueCountFrequency (%)
200
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 885
64.6%
Common 486
35.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
132
14.9%
68
7.7%
68
7.7%
68
7.7%
64
 
7.2%
64
 
7.2%
64
 
7.2%
64
 
7.2%
63
 
7.1%
63
 
7.1%
Other values (23) 167
18.9%
Common
ValueCountFrequency (%)
200
41.2%
7 38
 
7.8%
1 35
 
7.2%
2 29
 
6.0%
4 29
 
6.0%
- 28
 
5.8%
3 27
 
5.6%
6 27
 
5.6%
8 22
 
4.5%
9 16
 
3.3%
Other values (4) 35
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 885
64.6%
ASCII 486
35.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
200
41.2%
7 38
 
7.8%
1 35
 
7.2%
2 29
 
6.0%
4 29
 
6.0%
- 28
 
5.8%
3 27
 
5.6%
6 27
 
5.6%
8 22
 
4.5%
9 16
 
3.3%
Other values (4) 35
 
7.2%
Hangul
ValueCountFrequency (%)
132
14.9%
68
7.7%
68
7.7%
68
7.7%
64
 
7.2%
64
 
7.2%
64
 
7.2%
64
 
7.2%
63
 
7.1%
63
 
7.1%
Other values (23) 167
18.9%

Interactions

2023-12-11T13:17:58.753237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T13:18:02.239848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명시설 연락처주용도바닥면적(m2)소재지
시설명1.0001.0001.0000.0000.997
시설 연락처1.0001.0000.7750.8860.996
주용도1.0000.7751.0000.2800.986
바닥면적(m2)0.0000.8860.2801.0000.728
소재지0.9970.9960.9860.7281.000
2023-12-11T13:18:02.385897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주용도시설 연락처
주용도1.0000.359
시설 연락처0.3591.000
2023-12-11T13:18:02.519575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
바닥면적(m2)시설 연락처주용도
바닥면적(m2)1.0000.4910.096
시설 연락처0.4911.0000.359
주용도0.0960.3591.000

Missing values

2023-12-11T13:17:58.883363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T13:17:59.025150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명시설명시설 연락처주용도바닥면적(m2)소재지
0서울특별시강서구마곡중앙광장미기재판매시설21133.86서울특별시 강서구 마곡동 767-2번지
1서울특별시강서구공판장동미기재판매시설21108.14서울특별시 강서구 외발산동 424번지
2서울특별시강서구관리,트럭판매동미기재판매시설15681.74서울특별시 강서구 외발산동 427번지
3서울특별시강서구점포동미기재판매시설29832.22서울특별시 강서구 외발산동 427번지
4서울특별시강서구청과물동미기재판매시설67588.48서울특별시 강서구 외발산동 427번지
5서울특별시강서구청계빌딩미기재판매시설5003.4서울특별시 강서구 가양동 1479-10번지
6서울특별시강서구홈플러스 가양점02-2063-2080판매시설44885.33서울특별시 강서구 가양동 18-24번지
7서울특별시강서구이마트 가양점02-2101-1053판매시설53092.79서울특별시 강서구 가양동 449-19번지
8서울특별시강서구미기재미기재여객용운수시설128313.07서울특별시 강서구 공항동 1370번지
9서울특별시강서구공항청사및항공지원센터미기재여객용운수시설54218.29서울특별시 강서구 공항동 1373번지
시도명시군구명시설명시설 연락처주용도바닥면적(m2)소재지
58서울특별시강서구강남장로교회미기재종교시설8880.41서울특별시 강서구 화곡동 957-10번지
59서울특별시강서구롯데시티호텔 김포공항02-6116-1000관광숙박시설11312.0서울특별시 강서구 방화동 886번지
60서울특별시강서구칼튼호텔02-2604-9229관광숙박시설6141.79서울특별시 강서구 화곡동 1110-3번지
61서울특별시강서구호텔 에스02-2602-6002관광숙박시설6450.74서울특별시 강서구 화곡동 918-23번지
62서울특별시강서구코트야드 메리어트 서울보타닉파크02-6946-7000관광숙박시설19080.03서울특별시 강서구 마곡동 766-1번지
63서울특별시강서구우리들병원02-2660-7500종합병원11837.02서울특별시 강서구 하늘길 70
64서울특별시강서구롯데몰 김포공항02-6116-4000판매시설315098.0강서구 하늘길 38(방화동)
65서울특별시강서구삼정프라자02-2663-1978판매시설20992.0강서구 방화동로 126(방화동)
66서울특별시강서구스페이스프라자02-3663-6562판매시설21851.0강서구 공항대로71길 49(염창동)
67서울특별시강서구강서 뉴타워02-2605-3334판매시설27284.0강서구 화곡로18길 24(화곡동

Duplicate rows

Most frequently occurring

시도명시군구명시설명시설 연락처주용도바닥면적(m2)소재지# duplicates
0서울특별시강서구공판장동미기재판매시설21108.14서울특별시 강서구 외발산동 424번지2
1서울특별시강서구관리,트럭판매동미기재판매시설15681.74서울특별시 강서구 외발산동 427번지2
2서울특별시강서구마곡중앙광장미기재판매시설21133.86서울특별시 강서구 마곡동 767-2번지2
3서울특별시강서구점포동미기재판매시설29832.22서울특별시 강서구 외발산동 427번지2
4서울특별시강서구청과물동미기재판매시설67588.48서울특별시 강서구 외발산동 427번지2