Overview

Dataset statistics

Number of variables4
Number of observations151
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory33.9 B

Variable types

Categorical3
Numeric1

Dataset

Description자치구명,법정동명,업종명,업소수
Author강서구
URLhttps://data.seoul.go.kr/dataList/OA-9991/S/1/datasetView.do

Alerts

자치구명 has constant value ""Constant

Reproduction

Analysis started2024-05-18 07:33:55.275265
Analysis finished2024-05-18 07:33:56.657566
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
강서구
151 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강서구
2nd row강서구
3rd row강서구
4th row강서구
5th row강서구

Common Values

ValueCountFrequency (%)
강서구 151
100.0%

Length

2024-05-18T16:33:56.925719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:33:57.240070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강서구 151
100.0%

법정동명
Categorical

Distinct11
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
화곡동
22 
등촌동
20 
마곡동
19 
내발산동
19 
방화동
18 
Other values (6)
53 

Length

Max length4
Median length3
Mean length3.1589404
Min length3

Unique

Unique2 ?
Unique (%)1.3%

Sample

1st row염창동
2nd row염창동
3rd row염창동
4th row염창동
5th row염창동

Common Values

ValueCountFrequency (%)
화곡동 22
14.6%
등촌동 20
13.2%
마곡동 19
12.6%
내발산동 19
12.6%
방화동 18
11.9%
가양동 17
11.3%
공항동 15
9.9%
염창동 14
9.3%
외발산동 5
 
3.3%
개화동 1
 
0.7%

Length

2024-05-18T16:33:57.566864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화곡동 22
14.6%
등촌동 20
13.2%
마곡동 19
12.6%
내발산동 19
12.6%
방화동 18
11.9%
가양동 17
11.3%
공항동 15
9.9%
염창동 14
9.3%
외발산동 5
 
3.3%
개화동 1
 
0.7%

업종명
Categorical

Distinct24
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
위생관리용역업
 
10
이용업
 
9
일반미용업
 
9
피부미용업
 
9
숙박업(일반)
 
8
Other values (19)
106 

Length

Max length23
Median length16
Mean length9.1059603
Min length3

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row숙박업(일반)
2nd row목욕장업
3rd row이용업
4th row세탁업
5th row위생관리용역업

Common Values

ValueCountFrequency (%)
위생관리용역업 10
 
6.6%
이용업 9
 
6.0%
일반미용업 9
 
6.0%
피부미용업 9
 
6.0%
숙박업(일반) 8
 
5.3%
세탁업 8
 
5.3%
공중이용시설 8
 
5.3%
종합미용업 8
 
5.3%
네일미용업 8
 
5.3%
화장ㆍ분장 미용업 8
 
5.3%
Other values (14) 66
43.7%

Length

2024-05-18T16:33:57.995505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미용업 45
17.5%
화장ㆍ분장 40
15.6%
네일미용업 39
15.2%
피부미용업 38
14.8%
일반미용업 32
12.5%
위생관리용역업 10
 
3.9%
이용업 9
 
3.5%
숙박업(일반 8
 
3.1%
세탁업 8
 
3.1%
공중이용시설 8
 
3.1%
Other values (4) 20
7.8%

업소수
Real number (ℝ)

Distinct46
Distinct (%)30.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.913907
Minimum1
Maximum405
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-05-18T16:33:58.406393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q316.5
95-th percentile81.5
Maximum405
Range404
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation43.901396
Coefficient of variation (CV)2.2045596
Kurtosis41.188557
Mean19.913907
Median Absolute Deviation (MAD)3
Skewness5.5260722
Sum3007
Variance1927.3325
MonotonicityNot monotonic
2024-05-18T16:33:58.847545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
1 35
23.2%
2 20
13.2%
3 12
 
7.9%
4 11
 
7.3%
13 5
 
3.3%
5 5
 
3.3%
12 4
 
2.6%
7 4
 
2.6%
10 4
 
2.6%
35 3
 
2.0%
Other values (36) 48
31.8%
ValueCountFrequency (%)
1 35
23.2%
2 20
13.2%
3 12
 
7.9%
4 11
 
7.3%
5 5
 
3.3%
6 3
 
2.0%
7 4
 
2.6%
8 1
 
0.7%
9 2
 
1.3%
10 4
 
2.6%
ValueCountFrequency (%)
405 1
0.7%
198 1
0.7%
144 1
0.7%
136 1
0.7%
115 1
0.7%
107 1
0.7%
106 1
0.7%
84 1
0.7%
79 1
0.7%
78 1
0.7%

Interactions

2024-05-18T16:33:55.540096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T16:33:59.133466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동명업종명업소수
법정동명1.0000.0000.000
업종명0.0001.0000.000
업소수0.0000.0001.000
2024-05-18T16:33:59.367670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동명업종명
법정동명1.0000.000
업종명0.0001.000
2024-05-18T16:33:59.604493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소수법정동명업종명
업소수1.0000.0000.000
법정동명0.0001.0000.000
업종명0.0000.0001.000

Missing values

2024-05-18T16:33:56.120072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T16:33:56.504272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구명법정동명업종명업소수
0강서구염창동숙박업(일반)4
1강서구염창동목욕장업4
2강서구염창동이용업9
3강서구염창동세탁업12
4강서구염창동위생관리용역업2
5강서구염창동공중이용시설29
6강서구염창동일반미용업48
7강서구염창동피부미용업15
8강서구염창동종합미용업4
9강서구염창동네일미용업7
자치구명법정동명업종명업소수
141강서구방화동종합미용업5
142강서구방화동네일미용업5
143강서구방화동일반미용업, 피부미용업2
144강서구방화동피부미용업, 네일미용업2
145강서구방화동화장ㆍ분장 미용업2
146강서구방화동일반미용업, 화장ㆍ분장 미용업1
147강서구방화동네일미용업, 화장ㆍ분장 미용업4
148강서구방화동일반미용업, 네일미용업, 화장ㆍ분장 미용업1
149강서구개화동위생관리용역업3
150강서구과해동위생관리용역업1