Overview

Dataset statistics

Number of variables4
Number of observations54
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory35.4 B

Variable types

Categorical1
Text2
Numeric1

Dataset

Description부산광역시중구_다중이용시설대상업소현황_20220727
Author부산광역시 중구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3072202

Reproduction

Analysis started2023-12-10 16:35:56.889297
Analysis finished2023-12-10 16:35:57.417410
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

용도
Categorical

Distinct14
Distinct (%)25.9%
Missing0
Missing (%)0.0%
Memory size564.0 B
실내주차장
11 
대규모점포
PC영업시설
영화상영관
보육시설
Other values (9)
20 

Length

Max length6
Median length5.5
Mean length4.7962963
Min length2

Unique

Unique3 ?
Unique (%)5.6%

Sample

1st rowPC영업시설
2nd rowPC영업시설
3rd rowPC영업시설
4th rowPC영업시설
5th rowPC영업시설

Common Values

ValueCountFrequency (%)
실내주차장 11
20.4%
대규모점포 7
13.0%
PC영업시설 6
11.1%
영화상영관 6
11.1%
보육시설 4
 
7.4%
의료기관 4
 
7.4%
대규모 점포 3
 
5.6%
지하도상가 3
 
5.6%
지하역사 3
 
5.6%
노인요양시설 2
 
3.7%
Other values (4) 5
9.3%

Length

2023-12-11T01:35:57.504396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 11
19.3%
대규모점포 7
12.3%
pc영업시설 6
10.5%
영화상영관 6
10.5%
보육시설 4
 
7.0%
의료기관 4
 
7.0%
대규모 3
 
5.3%
점포 3
 
5.3%
지하도상가 3
 
5.3%
지하역사 3
 
5.3%
Other values (5) 7
12.3%
Distinct53
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-11T01:35:57.759948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length7.8703704
Min length4

Characters and Unicode

Total characters425
Distinct characters160
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)96.3%

Sample

1st row자이안트PC
2nd row더락pccafe
3rd row로떼PC카페
4th row리코스타PCCAFE
5th rowZERO LATENCY 서바이벌VR방
ValueCountFrequency (%)
메가박스부산극장(4~8관 2
 
3.3%
롯데시네마 2
 
3.3%
자갈치시장 2
 
3.3%
금강스파 1
 
1.7%
자갈치역 1
 
1.7%
관정빌딩 1
 
1.7%
구대영시네마건물주차장관리aj파크 1
 
1.7%
교보생명빌딩 1
 
1.7%
팬오션(구.stx 1
 
1.7%
무역회관 1
 
1.7%
Other values (47) 47
78.3%
2023-12-11T01:35:58.183701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
3.5%
9
 
2.1%
9
 
2.1%
8
 
1.9%
8
 
1.9%
8
 
1.9%
7
 
1.6%
C 7
 
1.6%
7
 
1.6%
7
 
1.6%
Other values (150) 340
80.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 355
83.5%
Uppercase Letter 38
 
8.9%
Space Separator 6
 
1.4%
Lowercase Letter 6
 
1.4%
Close Punctuation 5
 
1.2%
Open Punctuation 5
 
1.2%
Decimal Number 5
 
1.2%
Math Symbol 2
 
0.5%
Other Punctuation 2
 
0.5%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
4.2%
9
 
2.5%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (115) 270
76.1%
Uppercase Letter
ValueCountFrequency (%)
C 7
18.4%
P 4
 
10.5%
R 3
 
7.9%
A 3
 
7.9%
E 3
 
7.9%
T 2
 
5.3%
Y 2
 
5.3%
V 2
 
5.3%
X 1
 
2.6%
S 1
 
2.6%
Other values (10) 10
26.3%
Lowercase Letter
ValueCountFrequency (%)
c 2
33.3%
p 1
16.7%
a 1
16.7%
f 1
16.7%
e 1
16.7%
Decimal Number
ValueCountFrequency (%)
4 2
40.0%
8 2
40.0%
1 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 356
83.8%
Latin 44
 
10.4%
Common 25
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
4.2%
9
 
2.5%
9
 
2.5%
8
 
2.2%
8
 
2.2%
8
 
2.2%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (116) 271
76.1%
Latin
ValueCountFrequency (%)
C 7
15.9%
P 4
 
9.1%
R 3
 
6.8%
A 3
 
6.8%
E 3
 
6.8%
c 2
 
4.5%
T 2
 
4.5%
Y 2
 
4.5%
V 2
 
4.5%
X 1
 
2.3%
Other values (15) 15
34.1%
Common
ValueCountFrequency (%)
6
24.0%
) 5
20.0%
( 5
20.0%
4 2
 
8.0%
2
 
8.0%
8 2
 
8.0%
. 1
 
4.0%
1 1
 
4.0%
1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 355
83.5%
ASCII 66
 
15.5%
None 4
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
4.2%
9
 
2.5%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (115) 270
76.1%
ASCII
ValueCountFrequency (%)
C 7
 
10.6%
6
 
9.1%
) 5
 
7.6%
( 5
 
7.6%
P 4
 
6.1%
R 3
 
4.5%
A 3
 
4.5%
E 3
 
4.5%
4 2
 
3.0%
c 2
 
3.0%
Other values (22) 26
39.4%
None
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Distinct47
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-11T01:35:58.481399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length23.537037
Min length15

Characters and Unicode

Total characters1271
Distinct characters59
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)79.6%

Sample

1st row부산광역시 중구 비프광장로 16 (남포동6가)
2nd row부산광역시 중구 광복로 38, 5층 (창선동2가)
3rd row부산광역시 중구 비프광장로 17, 6층
4th row부산광역시 중구 구덕로48번길 4, 2층
5th row부산광역시 중구 비프광장로 37, 3층 (남포동5가)
ValueCountFrequency (%)
부산광역시 54
20.1%
중구 54
20.1%
중앙대로 11
 
4.1%
비프광장로 9
 
3.4%
남포동5가 7
 
2.6%
지하 6
 
2.2%
중앙동4가 6
 
2.2%
2 5
 
1.9%
중구로 4
 
1.5%
대청동4가 4
 
1.5%
Other values (73) 108
40.3%
2023-12-11T01:35:59.099781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
216
17.0%
85
 
6.7%
68
 
5.4%
63
 
5.0%
57
 
4.5%
55
 
4.3%
54
 
4.2%
54
 
4.2%
52
 
4.1%
44
 
3.5%
Other values (49) 523
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 792
62.3%
Space Separator 216
 
17.0%
Decimal Number 171
 
13.5%
Open Punctuation 42
 
3.3%
Close Punctuation 42
 
3.3%
Other Punctuation 6
 
0.5%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
10.7%
68
 
8.6%
63
 
8.0%
57
 
7.2%
55
 
6.9%
54
 
6.8%
54
 
6.8%
52
 
6.6%
44
 
5.6%
38
 
4.8%
Other values (34) 222
28.0%
Decimal Number
ValueCountFrequency (%)
3 26
15.2%
2 26
15.2%
1 26
15.2%
4 23
13.5%
5 19
11.1%
7 15
8.8%
6 15
8.8%
9 11
6.4%
8 6
 
3.5%
0 4
 
2.3%
Space Separator
ValueCountFrequency (%)
216
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 792
62.3%
Common 479
37.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
10.7%
68
 
8.6%
63
 
8.0%
57
 
7.2%
55
 
6.9%
54
 
6.8%
54
 
6.8%
52
 
6.6%
44
 
5.6%
38
 
4.8%
Other values (34) 222
28.0%
Common
ValueCountFrequency (%)
216
45.1%
( 42
 
8.8%
) 42
 
8.8%
3 26
 
5.4%
2 26
 
5.4%
1 26
 
5.4%
4 23
 
4.8%
5 19
 
4.0%
7 15
 
3.1%
6 15
 
3.1%
Other values (5) 29
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 792
62.3%
ASCII 479
37.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
216
45.1%
( 42
 
8.8%
) 42
 
8.8%
3 26
 
5.4%
2 26
 
5.4%
1 26
 
5.4%
4 23
 
4.8%
5 19
 
4.0%
7 15
 
3.1%
6 15
 
3.1%
Other values (5) 29
 
6.1%
Hangul
ValueCountFrequency (%)
85
 
10.7%
68
 
8.6%
63
 
8.0%
57
 
7.2%
55
 
6.9%
54
 
6.8%
54
 
6.8%
52
 
6.6%
44
 
5.6%
38
 
4.8%
Other values (34) 222
28.0%
Distinct53
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9870.4917
Minimum288
Maximum76599
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2023-12-11T01:35:59.483003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum288
5-th percentile375.36
Q11184.2875
median3655.405
Q39682.8775
95-th percentile44530.717
Maximum76599
Range76311
Interquartile range (IQR)8498.59

Descriptive statistics

Standard deviation15874.902
Coefficient of variation (CV)1.6083193
Kurtosis6.8979499
Mean9870.4917
Median Absolute Deviation (MAD)3094.905
Skewness2.6074851
Sum533006.55
Variance2.5201251 × 108
MonotonicityNot monotonic
2023-12-11T01:35:59.735963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
375.36 2
 
3.7%
31376.0 1
 
1.9%
21641.0 1
 
1.9%
2400.0 1
 
1.9%
3734.0 1
 
1.9%
76599.0 1
 
1.9%
52298.62 1
 
1.9%
2298.0 1
 
1.9%
920.0 1
 
1.9%
1613.16 1
 
1.9%
Other values (43) 43
79.6%
ValueCountFrequency (%)
288.0 1
1.9%
349.66 1
1.9%
375.36 2
3.7%
397.69 1
1.9%
437.0 1
1.9%
475.22 1
1.9%
501.28 1
1.9%
538.0 1
1.9%
583.0 1
1.9%
920.0 1
1.9%
ValueCountFrequency (%)
76599.0 1
1.9%
57533.0 1
1.9%
52298.62 1
1.9%
40348.0 1
1.9%
39639.28 1
1.9%
31376.0 1
1.9%
21641.0 1
1.9%
21359.0 1
1.9%
17747.0 1
1.9%
16198.0 1
1.9%

Interactions

2023-12-11T01:35:57.193037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:35:59.916943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도사업장명도로명주소적용대상규모(제곱미터)
용도1.0001.0000.9700.248
사업장명1.0001.0000.9941.000
도로명주소0.9700.9941.0000.000
적용대상규모(제곱미터)0.2481.0000.0001.000
2023-12-11T01:36:00.135492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적용대상규모(제곱미터)용도
적용대상규모(제곱미터)1.0000.108
용도0.1081.000

Missing values

2023-12-11T01:35:57.304470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:35:57.379985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

용도사업장명도로명주소적용대상규모(제곱미터)
0PC영업시설자이안트PC부산광역시 중구 비프광장로 16 (남포동6가)375.36
1PC영업시설더락pccafe부산광역시 중구 광복로 38, 5층 (창선동2가)349.66
2PC영업시설로떼PC카페부산광역시 중구 비프광장로 17, 6층475.22
3PC영업시설리코스타PCCAFE부산광역시 중구 구덕로48번길 4, 2층931.33
4PC영업시설ZERO LATENCY 서바이벌VR방부산광역시 중구 비프광장로 37, 3층 (남포동5가)501.28
5PC영업시설아이센스리그PC방부산남포점부산광역시 중구 비프광장로 16, 3층 (남포동6가)375.36
6노인요양시설심당요양병원부산광역시 중구 동광길 179 (영주동)2077.95
7노인요양시설송산노인전문요양원부산광역시 중구 충장대로13번길 31 (중앙동4가)1810.0
8대규모 점포롯데백화점 광복점부산광역시 중구 중앙대로 2 (중앙동7가)57533.0
9대규모 점포부산데파트부산광역시 중구 중앙대로 215758.9
용도사업장명도로명주소적용대상규모(제곱미터)
44의료기관신창요양병원부산광역시 중구 중앙대로 55 (중앙동2가)2649.71
45지하도상가남포지하상가(코오롱지하상가)부산광역시 중구 구덕로 지하 4417747.0
46지하도상가광복지하상가(롯데1번가)부산광역시 중구 중앙대로 지하 1716198.0
47지하도상가국제지하상가부산광역시 중구 중구로 지하 313019.0
48지하역사중앙동역부산광역시 중구 중앙대로 지하 937695.0
49지하역사남포동역부산광역시 중구 구덕로 지하 129745.0
50지하역사자갈치역부산광역시 중구 구덕로 지하 808914.0
51목욕장금강스파부산광역시 중구 흑교로31번길 3-1 (부평동3가)1999.34
52목욕장자갈치효소편백원부산광역시 중구 자갈치해안로 52, 3층 (남포동4가)397.69
53학원YBM어학원부산광역시 중구 광복로 95 (광복동1가35-1)1041.33