Overview

Dataset statistics

Number of variables4
Number of observations285
Missing cells5
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.3 KiB
Average record size in memory33.5 B

Variable types

Categorical2
Text1
Numeric1

Dataset

Description자치구명,법정동명,업태명,업소수
Author노원구
URLhttps://data.seoul.go.kr/dataList/OA-10991/S/1/datasetView.do

Alerts

자치구명 has constant value ""Constant
업태명 has 5 (1.8%) missing valuesMissing

Reproduction

Analysis started2024-05-11 05:29:57.983137
Analysis finished2024-05-11 05:29:58.703187
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
노원구
285 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노원구
2nd row노원구
3rd row노원구
4th row노원구
5th row노원구

Common Values

ValueCountFrequency (%)
노원구 285
100.0%

Length

2024-05-11T14:29:58.804082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:29:58.937830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노원구 285
100.0%

법정동명
Categorical

Distinct7
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
상계동
68 
공릉동
57 
월계동
56 
중계동
53 
하계동
49 
Other values (2)
 
2

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row월계동
2nd row월계동
3rd row월계동
4th row월계동
5th row월계동

Common Values

ValueCountFrequency (%)
상계동 68
23.9%
공릉동 57
20.0%
월계동 56
19.6%
중계동 53
18.6%
하계동 49
17.2%
신흥동 1
 
0.4%
대월면 1
 
0.4%

Length

2024-05-11T14:29:59.070817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:29:59.281104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상계동 68
23.9%
공릉동 57
20.0%
월계동 56
19.6%
중계동 53
18.6%
하계동 49
17.2%
신흥동 1
 
0.4%
대월면 1
 
0.4%

업태명
Text

MISSING 

Distinct71
Distinct (%)25.4%
Missing5
Missing (%)1.8%
Memory size2.4 KiB
2024-05-11T14:29:59.593324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length5.6285714
Min length2

Characters and Unicode

Total characters1576
Distinct characters155
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)4.6%

Sample

1st row한식
2nd row중국식
3rd row경양식
4th row일식
5th row분식
ValueCountFrequency (%)
기타 18
 
5.9%
패스트푸드 10
 
3.3%
식품제조가공업 9
 
3.0%
집단급식소 7
 
2.3%
일반조리판매 6
 
2.0%
제과점영업 6
 
2.0%
식품등 5
 
1.7%
학교 5
 
1.7%
병원 5
 
1.7%
사회복지시설 5
 
1.7%
Other values (61) 227
74.9%
2024-05-11T14:30:00.171842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
 
6.6%
88
 
5.6%
69
 
4.4%
65
 
4.1%
47
 
3.0%
45
 
2.9%
33
 
2.1%
) 31
 
2.0%
( 31
 
2.0%
31
 
2.0%
Other values (145) 1032
65.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1471
93.3%
Close Punctuation 31
 
2.0%
Open Punctuation 31
 
2.0%
Space Separator 23
 
1.5%
Other Punctuation 20
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
7.1%
88
 
6.0%
69
 
4.7%
65
 
4.4%
47
 
3.2%
45
 
3.1%
33
 
2.2%
31
 
2.1%
25
 
1.7%
23
 
1.6%
Other values (140) 941
64.0%
Other Punctuation
ValueCountFrequency (%)
/ 15
75.0%
, 5
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1471
93.3%
Common 105
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
7.1%
88
 
6.0%
69
 
4.7%
65
 
4.4%
47
 
3.2%
45
 
3.1%
33
 
2.2%
31
 
2.1%
25
 
1.7%
23
 
1.6%
Other values (140) 941
64.0%
Common
ValueCountFrequency (%)
) 31
29.5%
( 31
29.5%
23
21.9%
/ 15
14.3%
, 5
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1471
93.3%
ASCII 105
 
6.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
104
 
7.1%
88
 
6.0%
69
 
4.7%
65
 
4.4%
47
 
3.2%
45
 
3.1%
33
 
2.2%
31
 
2.1%
25
 
1.7%
23
 
1.6%
Other values (140) 941
64.0%
ASCII
ValueCountFrequency (%)
) 31
29.5%
( 31
29.5%
23
21.9%
/ 15
14.3%
, 5
 
4.8%

업소수
Real number (ℝ)

Distinct81
Distinct (%)28.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.249123
Minimum1
Maximum732
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-05-11T14:30:00.423828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median7
Q327
95-th percentile125.6
Maximum732
Range731
Interquartile range (IQR)24

Descriptive statistics

Standard deviation64.068881
Coefficient of variation (CV)2.1904548
Kurtosis54.644254
Mean29.249123
Median Absolute Deviation (MAD)6
Skewness6.1365769
Sum8336
Variance4104.8215
MonotonicityNot monotonic
2024-05-11T14:30:00.720411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 44
 
15.4%
2 24
 
8.4%
3 23
 
8.1%
4 16
 
5.6%
6 15
 
5.3%
5 14
 
4.9%
7 7
 
2.5%
8 7
 
2.5%
11 7
 
2.5%
9 6
 
2.1%
Other values (71) 122
42.8%
ValueCountFrequency (%)
1 44
15.4%
2 24
8.4%
3 23
8.1%
4 16
 
5.6%
5 14
 
4.9%
6 15
 
5.3%
7 7
 
2.5%
8 7
 
2.5%
9 6
 
2.1%
10 4
 
1.4%
ValueCountFrequency (%)
732 1
0.4%
336 1
0.4%
321 1
0.4%
273 1
0.4%
228 1
0.4%
220 1
0.4%
200 1
0.4%
198 1
0.4%
185 1
0.4%
169 1
0.4%

Interactions

2024-05-11T14:29:58.362078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T14:30:00.930100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동명업태명업소수
법정동명1.0000.0000.000
업태명0.0001.0000.000
업소수0.0000.0001.000
2024-05-11T14:30:01.057709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소수법정동명
업소수1.0000.000
법정동명0.0001.000

Missing values

2024-05-11T14:29:58.543364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T14:29:58.654506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구명법정동명업태명업소수
0노원구월계동한식169
1노원구월계동중국식20
2노원구월계동경양식20
3노원구월계동일식22
4노원구월계동분식40
5노원구월계동뷔페식5
6노원구월계동정종/대포집/소주방1
7노원구월계동패스트푸드7
8노원구월계동호프/통닭62
9노원구월계동통닭(치킨)17
자치구명법정동명업태명업소수
275노원구중계동건강기능식품수입업4
276노원구중계동영업장판매82
277노원구중계동방문판매27
278노원구중계동전자상거래(통신판매업)107
279노원구중계동<NA>9
280노원구중계동다단계판매7
281노원구중계동기타 건강기능식품일반판매업1
282노원구중계동건강기능식품유통전문판매업1
283노원구신흥동일반조리판매1
284노원구대월면제과점영업1