Overview

Dataset statistics

Number of variables4
Number of observations734
Missing cells14
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.8 KiB
Average record size in memory33.2 B

Variable types

Categorical2
Text1
Numeric1

Dataset

Description자치구명,법정동명,업태명,업소수
Author송파구
URLhttps://data.seoul.go.kr/dataList/OA-11453/S/1/datasetView.do

Alerts

자치구명 has constant value ""Constant
업태명 has 14 (1.9%) missing valuesMissing

Reproduction

Analysis started2024-05-11 05:23:44.713857
Analysis finished2024-05-11 05:23:48.684530
Duration3.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
송파구
734 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row송파구
2nd row송파구
3rd row송파구
4th row송파구
5th row송파구

Common Values

ValueCountFrequency (%)
송파구 734
100.0%

Length

2024-05-11T14:23:48.912407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:23:49.125273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
송파구 734
100.0%

법정동명
Categorical

Distinct17
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
잠실동
70 
가락동
69 
문정동
61 
방이동
61 
석촌동
60 
Other values (12)
413 

Length

Max length4
Median length3
Mean length3.0013624
Min length3

Unique

Unique3 ?
Unique (%)0.4%

Sample

1st row역삼동
2nd row역삼동
3rd row대치동
4th row<NA>
5th row잠실동

Common Values

ValueCountFrequency (%)
잠실동 70
9.5%
가락동 69
9.4%
문정동 61
8.3%
방이동 61
8.3%
석촌동 60
8.2%
송파동 57
 
7.8%
신천동 56
 
7.6%
마천동 52
 
7.1%
삼전동 52
 
7.1%
거여동 50
 
6.8%
Other values (7) 146
19.9%

Length

2024-05-11T14:23:49.333030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
잠실동 70
9.5%
가락동 69
9.4%
문정동 61
8.3%
방이동 61
8.3%
석촌동 60
8.2%
송파동 57
 
7.8%
신천동 56
 
7.6%
삼전동 52
 
7.1%
마천동 52
 
7.1%
거여동 50
 
6.8%
Other values (7) 146
19.9%

업태명
Text

MISSING 

Distinct83
Distinct (%)11.5%
Missing14
Missing (%)1.9%
Memory size5.9 KiB
2024-05-11T14:23:49.795503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length5.8138889
Min length2

Characters and Unicode

Total characters4186
Distinct characters162
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)1.2%

Sample

1st row영업장판매
2nd row방문판매
3rd row식품등 수입판매업
4th row한식
5th row중국식
ValueCountFrequency (%)
기타 59
 
7.4%
식품제조가공업 22
 
2.8%
집단급식소 22
 
2.8%
패스트푸드 19
 
2.4%
영업장판매 15
 
1.9%
식품등 14
 
1.8%
수입판매업 14
 
1.8%
방문판매 14
 
1.8%
편의점 13
 
1.6%
식품판매업 13
 
1.6%
Other values (74) 592
74.3%
2024-05-11T14:23:50.558804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
291
 
7.0%
252
 
6.0%
192
 
4.6%
181
 
4.3%
142
 
3.4%
139
 
3.3%
) 85
 
2.0%
( 85
 
2.0%
78
 
1.9%
78
 
1.9%
Other values (152) 2663
63.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3886
92.8%
Close Punctuation 85
 
2.0%
Open Punctuation 85
 
2.0%
Space Separator 77
 
1.8%
Other Punctuation 53
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
291
 
7.5%
252
 
6.5%
192
 
4.9%
181
 
4.7%
142
 
3.7%
139
 
3.6%
78
 
2.0%
78
 
2.0%
76
 
2.0%
64
 
1.6%
Other values (146) 2393
61.6%
Other Punctuation
ValueCountFrequency (%)
/ 35
66.0%
, 13
 
24.5%
. 5
 
9.4%
Close Punctuation
ValueCountFrequency (%)
) 85
100.0%
Open Punctuation
ValueCountFrequency (%)
( 85
100.0%
Space Separator
ValueCountFrequency (%)
77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3886
92.8%
Common 300
 
7.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
291
 
7.5%
252
 
6.5%
192
 
4.9%
181
 
4.7%
142
 
3.7%
139
 
3.6%
78
 
2.0%
78
 
2.0%
76
 
2.0%
64
 
1.6%
Other values (146) 2393
61.6%
Common
ValueCountFrequency (%)
) 85
28.3%
( 85
28.3%
77
25.7%
/ 35
11.7%
, 13
 
4.3%
. 5
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3886
92.8%
ASCII 300
 
7.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
291
 
7.5%
252
 
6.5%
192
 
4.9%
181
 
4.7%
142
 
3.7%
139
 
3.6%
78
 
2.0%
78
 
2.0%
76
 
2.0%
64
 
1.6%
Other values (146) 2393
61.6%
ASCII
ValueCountFrequency (%)
) 85
28.3%
( 85
28.3%
77
25.7%
/ 35
11.7%
, 13
 
4.3%
. 5
 
1.7%

업소수
Real number (ℝ)

Distinct121
Distinct (%)16.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.126703
Minimum1
Maximum686
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.6 KiB
2024-05-11T14:23:50.823225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median6
Q322
95-th percentile114.4
Maximum686
Range685
Interquartile range (IQR)20

Descriptive statistics

Standard deviation59.175545
Coefficient of variation (CV)2.264945
Kurtosis38.800137
Mean26.126703
Median Absolute Deviation (MAD)5
Skewness5.3529178
Sum19177
Variance3501.7452
MonotonicityNot monotonic
2024-05-11T14:23:51.122995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 130
17.7%
2 75
 
10.2%
3 64
 
8.7%
5 37
 
5.0%
4 36
 
4.9%
6 29
 
4.0%
7 23
 
3.1%
8 21
 
2.9%
9 18
 
2.5%
10 18
 
2.5%
Other values (111) 283
38.6%
ValueCountFrequency (%)
1 130
17.7%
2 75
10.2%
3 64
8.7%
4 36
 
4.9%
5 37
 
5.0%
6 29
 
4.0%
7 23
 
3.1%
8 21
 
2.9%
9 18
 
2.5%
10 18
 
2.5%
ValueCountFrequency (%)
686 1
0.1%
489 1
0.1%
454 1
0.1%
451 1
0.1%
417 1
0.1%
352 1
0.1%
329 1
0.1%
290 1
0.1%
269 1
0.1%
248 1
0.1%

Interactions

2024-05-11T14:23:48.083735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T14:23:51.345772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동명업태명업소수
법정동명1.0000.0000.000
업태명0.0001.0000.230
업소수0.0000.2301.000
2024-05-11T14:23:51.533118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소수법정동명
업소수1.0000.000
법정동명0.0001.000

Missing values

2024-05-11T14:23:48.418605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T14:23:48.575513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구명법정동명업태명업소수
0송파구역삼동영업장판매1
1송파구역삼동방문판매1
2송파구대치동<NA>1
3송파구<NA>식품등 수입판매업6
4송파구잠실동한식417
5송파구잠실동중국식33
6송파구잠실동경양식53
7송파구잠실동일식61
8송파구잠실동분식74
9송파구잠실동뷔페식3
자치구명법정동명업태명업소수
724송파구마천동건강기능식품수입업2
725송파구마천동영업장판매30
726송파구마천동방문판매7
727송파구마천동전화권유판매1
728송파구마천동전자상거래(통신판매업)51
729송파구마천동<NA>1
730송파구마천동다단계판매7
731송파구마천동기타 건강기능식품일반판매업2
732송파구마천동건강기능식품유통전문판매업11
733송파구성내동영업장판매3