Overview

Dataset statistics

Number of variables7
Number of observations24
Missing cells20
Missing cells (%)11.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory64.5 B

Variable types

Numeric3
DateTime1
Text2
Categorical1

Dataset

Description부산광역시_동구_PC방현황_20200622
Author부산광역시 동구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3079099

Alerts

연번 is highly overall correlated with 업종명High correlation
우편번호 is highly overall correlated with 업종명High correlation
시설면적 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
업종명 is highly imbalanced (75.0%)Imbalance
연번 has 1 (4.2%) missing valuesMissing
등록(신고)일자 has 1 (4.2%) missing valuesMissing
상호 has 1 (4.2%) missing valuesMissing
우편번호 has 15 (62.5%) missing valuesMissing
영업소소재지(도로명) has 1 (4.2%) missing valuesMissing
시설면적 has 1 (4.2%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:52:11.704457
Analysis finished2023-12-10 16:52:14.123173
Duration2.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct23
Distinct (%)100.0%
Missing1
Missing (%)4.2%
Infinite0
Infinite (%)0.0%
Mean12
Minimum1
Maximum23
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-11T01:52:14.310930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.1
Q16.5
median12
Q317.5
95-th percentile21.9
Maximum23
Range22
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.78233
Coefficient of variation (CV)0.56519417
Kurtosis-1.2
Mean12
Median Absolute Deviation (MAD)6
Skewness0
Sum276
Variance46
MonotonicityStrictly increasing
2023-12-11T01:52:14.511982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1 1
 
4.2%
2 1
 
4.2%
23 1
 
4.2%
22 1
 
4.2%
21 1
 
4.2%
20 1
 
4.2%
19 1
 
4.2%
18 1
 
4.2%
17 1
 
4.2%
16 1
 
4.2%
Other values (13) 13
54.2%
ValueCountFrequency (%)
1 1
4.2%
2 1
4.2%
3 1
4.2%
4 1
4.2%
5 1
4.2%
6 1
4.2%
7 1
4.2%
8 1
4.2%
9 1
4.2%
10 1
4.2%
ValueCountFrequency (%)
23 1
4.2%
22 1
4.2%
21 1
4.2%
20 1
4.2%
19 1
4.2%
18 1
4.2%
17 1
4.2%
16 1
4.2%
15 1
4.2%
14 1
4.2%

등록(신고)일자
Date

MISSING 

Distinct22
Distinct (%)95.7%
Missing1
Missing (%)4.2%
Memory size324.0 B
Minimum2007-09-19 00:00:00
Maximum2018-06-28 00:00:00
2023-12-11T01:52:14.705226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:14.900295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)

상호
Text

MISSING 

Distinct23
Distinct (%)100.0%
Missing1
Missing (%)4.2%
Memory size324.0 B
2023-12-11T01:52:15.179385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length8.0434783
Min length4

Characters and Unicode

Total characters185
Distinct characters69
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row놀토피시존
2nd row사이블루 PC ZONE 수정점
3rd row아이비스PC방
4th rowOn:A PC방
5th row랜드마크 PC
ValueCountFrequency (%)
pc 9
 
20.0%
pc방 3
 
6.7%
zone 2
 
4.4%
수정점 2
 
4.4%
놀토피시존 1
 
2.2%
루나틱 1
 
2.2%
pc토랑 1
 
2.2%
꿈꾸는 1
 
2.2%
n 1
 
2.2%
cook 1
 
2.2%
Other values (23) 23
51.1%
2023-12-11T01:52:15.694978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
11.9%
C 20
 
10.8%
P 19
 
10.3%
10
 
5.4%
O 6
 
3.2%
6
 
3.2%
5
 
2.7%
4
 
2.2%
3
 
1.6%
3
 
1.6%
Other values (59) 87
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95
51.4%
Uppercase Letter 59
31.9%
Space Separator 22
 
11.9%
Lowercase Letter 6
 
3.2%
Other Punctuation 1
 
0.5%
Open Punctuation 1
 
0.5%
Close Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
10.5%
6
 
6.3%
5
 
5.3%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.1%
Other values (41) 53
55.8%
Uppercase Letter
ValueCountFrequency (%)
C 20
33.9%
P 19
32.2%
O 6
 
10.2%
A 3
 
5.1%
X 2
 
3.4%
E 2
 
3.4%
N 2
 
3.4%
Z 2
 
3.4%
M 1
 
1.7%
K 1
 
1.7%
Lowercase Letter
ValueCountFrequency (%)
p 2
33.3%
c 2
33.3%
n 2
33.3%
Space Separator
ValueCountFrequency (%)
22
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95
51.4%
Latin 65
35.1%
Common 25
 
13.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
10.5%
6
 
6.3%
5
 
5.3%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.1%
Other values (41) 53
55.8%
Latin
ValueCountFrequency (%)
C 20
30.8%
P 19
29.2%
O 6
 
9.2%
A 3
 
4.6%
p 2
 
3.1%
X 2
 
3.1%
c 2
 
3.1%
n 2
 
3.1%
E 2
 
3.1%
N 2
 
3.1%
Other values (4) 5
 
7.7%
Common
ValueCountFrequency (%)
22
88.0%
: 1
 
4.0%
( 1
 
4.0%
) 1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95
51.4%
ASCII 90
48.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22
24.4%
C 20
22.2%
P 19
21.1%
O 6
 
6.7%
A 3
 
3.3%
p 2
 
2.2%
X 2
 
2.2%
c 2
 
2.2%
n 2
 
2.2%
E 2
 
2.2%
Other values (8) 10
11.1%
Hangul
ValueCountFrequency (%)
10
 
10.5%
6
 
6.3%
5
 
5.3%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.1%
Other values (41) 53
55.8%

우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct8
Distinct (%)88.9%
Missing15
Missing (%)62.5%
Infinite0
Infinite (%)0.0%
Mean48774.778
Minimum48726
Maximum48816
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-11T01:52:15.890235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum48726
5-th percentile48732
Q148744
median48781
Q348815
95-th percentile48816
Maximum48816
Range90
Interquartile range (IQR)71

Descriptive statistics

Standard deviation36.272503
Coefficient of variation (CV)0.00074367335
Kurtosis-1.9454979
Mean48774.778
Median Absolute Deviation (MAD)35
Skewness0.00046855526
Sum438973
Variance1315.6944
MonotonicityNot monotonic
2023-12-11T01:52:16.021378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
48816 2
 
8.3%
48781 1
 
4.2%
48744 1
 
4.2%
48788 1
 
4.2%
48741 1
 
4.2%
48815 1
 
4.2%
48746 1
 
4.2%
48726 1
 
4.2%
(Missing) 15
62.5%
ValueCountFrequency (%)
48726 1
4.2%
48741 1
4.2%
48744 1
4.2%
48746 1
4.2%
48781 1
4.2%
48788 1
4.2%
48815 1
4.2%
48816 2
8.3%
ValueCountFrequency (%)
48816 2
8.3%
48815 1
4.2%
48788 1
4.2%
48781 1
4.2%
48746 1
4.2%
48744 1
4.2%
48741 1
4.2%
48726 1
4.2%
Distinct23
Distinct (%)100.0%
Missing1
Missing (%)4.2%
Memory size324.0 B
2023-12-11T01:52:16.264461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length26.521739
Min length22

Characters and Unicode

Total characters610
Distinct characters59
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row부산광역시 동구 초량로 9-1 (초량동)
2nd row부산광역시 동구 고관로 102 (수정동,동환빌딩2층)
3rd row부산광역시 동구 초량상로 84-1 (초량동)
4th row부산광역시 동구 초량중로 101 (초량동)
5th row부산광역시 동구 범일로102번길 13 (범일동)
ValueCountFrequency (%)
부산광역시 23
18.5%
동구 23
18.5%
초량동 8
 
6.5%
수정동 5
 
4.0%
범일동 5
 
4.0%
중앙대로 4
 
3.2%
초량중로 3
 
2.4%
3층 3
 
2.4%
범일로102번길 3
 
2.4%
14 3
 
2.4%
Other values (38) 44
35.5%
2023-12-11T01:52:16.708717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
107
17.5%
48
 
7.9%
1 26
 
4.3%
24
 
3.9%
24
 
3.9%
23
 
3.8%
23
 
3.8%
23
 
3.8%
23
 
3.8%
) 23
 
3.8%
Other values (49) 266
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 359
58.9%
Space Separator 107
 
17.5%
Decimal Number 84
 
13.8%
Close Punctuation 23
 
3.8%
Open Punctuation 23
 
3.8%
Other Punctuation 11
 
1.8%
Dash Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
13.4%
24
 
6.7%
24
 
6.7%
23
 
6.4%
23
 
6.4%
23
 
6.4%
23
 
6.4%
23
 
6.4%
17
 
4.7%
17
 
4.7%
Other values (35) 114
31.8%
Decimal Number
ValueCountFrequency (%)
1 26
31.0%
2 11
13.1%
0 10
 
11.9%
3 9
 
10.7%
4 8
 
9.5%
9 7
 
8.3%
5 5
 
6.0%
8 4
 
4.8%
7 4
 
4.8%
Space Separator
ValueCountFrequency (%)
107
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 359
58.9%
Common 251
41.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
13.4%
24
 
6.7%
24
 
6.7%
23
 
6.4%
23
 
6.4%
23
 
6.4%
23
 
6.4%
23
 
6.4%
17
 
4.7%
17
 
4.7%
Other values (35) 114
31.8%
Common
ValueCountFrequency (%)
107
42.6%
1 26
 
10.4%
) 23
 
9.2%
( 23
 
9.2%
, 11
 
4.4%
2 11
 
4.4%
0 10
 
4.0%
3 9
 
3.6%
4 8
 
3.2%
9 7
 
2.8%
Other values (4) 16
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 359
58.9%
ASCII 251
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
107
42.6%
1 26
 
10.4%
) 23
 
9.2%
( 23
 
9.2%
, 11
 
4.4%
2 11
 
4.4%
0 10
 
4.0%
3 9
 
3.6%
4 8
 
3.2%
9 7
 
2.8%
Other values (4) 16
 
6.4%
Hangul
ValueCountFrequency (%)
48
13.4%
24
 
6.7%
24
 
6.7%
23
 
6.4%
23
 
6.4%
23
 
6.4%
23
 
6.4%
23
 
6.4%
17
 
4.7%
17
 
4.7%
Other values (35) 114
31.8%

시설면적
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct23
Distinct (%)100.0%
Missing1
Missing (%)4.2%
Infinite0
Infinite (%)0.0%
Mean218.41739
Minimum61.49
Maximum409.01
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-11T01:52:16.869874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum61.49
5-th percentile99.94
Q1154.775
median216.86
Q3271.135
95-th percentile392.84
Maximum409.01
Range347.52
Interquartile range (IQR)116.36

Descriptive statistics

Standard deviation90.113475
Coefficient of variation (CV)0.41257463
Kurtosis0.014599258
Mean218.41739
Median Absolute Deviation (MAD)61.12
Skewness0.48415178
Sum5023.6
Variance8120.4383
MonotonicityNot monotonic
2023-12-11T01:52:17.051664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
97.92 1
 
4.2%
243.95 1
 
4.2%
272.09 1
 
4.2%
61.49 1
 
4.2%
233.0 1
 
4.2%
155.74 1
 
4.2%
173.68 1
 
4.2%
409.01 1
 
4.2%
396.99 1
 
4.2%
270.18 1
 
4.2%
Other values (13) 13
54.2%
ValueCountFrequency (%)
61.49 1
4.2%
97.92 1
4.2%
118.12 1
4.2%
120.44 1
4.2%
133.31 1
4.2%
153.81 1
4.2%
155.74 1
4.2%
173.68 1
4.2%
187.78 1
4.2%
200.12 1
4.2%
ValueCountFrequency (%)
409.01 1
4.2%
396.99 1
4.2%
355.49 1
4.2%
278.28 1
4.2%
278.27 1
4.2%
272.09 1
4.2%
270.18 1
4.2%
243.95 1
4.2%
234.15 1
4.2%
233.0 1
4.2%

업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
인터넷컴퓨터게임시설제공업
23 
<NA>
 
1

Length

Max length13
Median length13
Mean length12.625
Min length4

Unique

Unique1 ?
Unique (%)4.2%

Sample

1st row인터넷컴퓨터게임시설제공업
2nd row인터넷컴퓨터게임시설제공업
3rd row인터넷컴퓨터게임시설제공업
4th row인터넷컴퓨터게임시설제공업
5th row인터넷컴퓨터게임시설제공업

Common Values

ValueCountFrequency (%)
인터넷컴퓨터게임시설제공업 23
95.8%
<NA> 1
 
4.2%

Length

2023-12-11T01:52:17.272145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:52:17.453786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인터넷컴퓨터게임시설제공업 23
95.8%
na 1
 
4.2%

Interactions

2023-12-11T01:52:13.104592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:12.159451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:12.697930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:13.236076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:12.348744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:12.837129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:13.351498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:12.538863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:52:12.960966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:52:17.622917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록(신고)일자상호우편번호영업소소재지(도로명)시설면적
연번1.0000.8911.0000.0001.0000.558
등록(신고)일자0.8911.0001.0001.0001.0000.963
상호1.0001.0001.0001.0001.0001.000
우편번호0.0001.0001.0001.0001.0000.486
영업소소재지(도로명)1.0001.0001.0001.0001.0001.000
시설면적0.5580.9631.0000.4861.0001.000
2023-12-11T01:52:17.801267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호시설면적업종명
연번1.000-0.2930.1411.000
우편번호-0.2931.000-0.0421.000
시설면적0.141-0.0421.0001.000
업종명1.0001.0001.0001.000

Missing values

2023-12-11T01:52:13.546918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:52:13.770976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:52:13.956647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번등록(신고)일자상호우편번호영업소소재지(도로명)시설면적업종명
012007-09-19놀토피시존<NA>부산광역시 동구 초량로 9-1 (초량동)97.92인터넷컴퓨터게임시설제공업
122007-12-05사이블루 PC ZONE 수정점<NA>부산광역시 동구 고관로 102 (수정동,동환빌딩2층)243.95인터넷컴퓨터게임시설제공업
232008-05-20아이비스PC방<NA>부산광역시 동구 초량상로 84-1 (초량동)120.44인터넷컴퓨터게임시설제공업
342008-05-23On:A PC방<NA>부산광역시 동구 초량중로 101 (초량동)202.8인터넷컴퓨터게임시설제공업
452008-05-26랜드마크 PC<NA>부산광역시 동구 범일로102번길 13 (범일동)278.28인터넷컴퓨터게임시설제공업
562008-05-26라이또 PC방<NA>부산광역시 동구 초량상로 94 (초량동)234.15인터넷컴퓨터게임시설제공업
672008-07-25사이버포유pc<NA>부산광역시 동구 초량상로 90-1 (초량동,부산장우신협 3층)118.12인터넷컴퓨터게임시설제공업
782008-07-30드레곤PC방<NA>부산광역시 동구 중앙대로371번길 9 (수정동)230.12인터넷컴퓨터게임시설제공업
892008-08-01에이아이(AI)PC까페<NA>부산광역시 동구 초량중로 95 (초량동)355.49인터넷컴퓨터게임시설제공업
9102008-09-26피플pc<NA>부산광역시 동구 망양로 851 (수정동)133.31인터넷컴퓨터게임시설제공업
연번등록(신고)일자상호우편번호영업소소재지(도로명)시설면적업종명
14152011-10-25MAX PC ZONE<NA>부산광역시 동구 중앙대로 201 (초량동,행운빌딩 지하1층)216.86인터넷컴퓨터게임시설제공업
15162013-05-01라이온PC48816부산광역시 동구 중앙대로 197 (초량동)270.18인터넷컴퓨터게임시설제공업
16172015-07-17PC토랑48744부산광역시 동구 범일로102번길 14 (범일동)396.99인터넷컴퓨터게임시설제공업
17182015-12-09꿈꾸는 PC n COOK48788부산광역시 동구 중앙대로371번길 12, 4층 (수정동)409.01인터넷컴퓨터게임시설제공업
18192016-06-13수상한 PC48741부산광역시 동구 조방로 14 (범일동, 동일타워)173.68인터넷컴퓨터게임시설제공업
19202016-12-27루나틱 하이 PC48815부산광역시 동구 초량중로 78, 2층 (초량동)155.74인터넷컴퓨터게임시설제공업
20212017-10-26플러스피시방 본점48816부산광역시 동구 대영로243번길 89 (초량동)233.0인터넷컴퓨터게임시설제공업
21222018-05-21스포플레이 PC48746부산광역시 동구 자성공원로 20, 1층 (범일동)61.49인터넷컴퓨터게임시설제공업
22232018-06-28OX PC 범일동점48726부산광역시 동구 중앙대로 540, 2층 (범일동)272.09인터넷컴퓨터게임시설제공업
23<NA><NA><NA><NA><NA><NA><NA>