Overview

Dataset statistics

Number of variables7
Number of observations58
Missing cells47
Missing cells (%)11.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory61.3 B

Variable types

Numeric3
Text1
Categorical2
DateTime1

Dataset

Description부산광역시_여성회관창업지원센터현황_20230920
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3044615

Alerts

연번 is highly overall correlated with 면적_제곱미터High correlation
면적_제곱미터 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
종사자수_대표자포함 is highly overall correlated with 면적_제곱미터High correlation
소재지 또는 홈페이지 주소 is highly overall correlated with 면적_제곱미터High correlation
면적_제곱미터 has 47 (81.0%) missing valuesMissing
연번 has unique valuesUnique
업체명_대표자 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:43:54.284948
Analysis finished2023-12-10 16:43:56.225436
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.5
Minimum1
Maximum58
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size654.0 B
2023-12-11T01:43:56.337571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.85
Q115.25
median29.5
Q343.75
95-th percentile55.15
Maximum58
Range57
Interquartile range (IQR)28.5

Descriptive statistics

Standard deviation16.886879
Coefficient of variation (CV)0.57243656
Kurtosis-1.2
Mean29.5
Median Absolute Deviation (MAD)14.5
Skewness0
Sum1711
Variance285.16667
MonotonicityStrictly increasing
2023-12-11T01:43:56.563118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
45 1
 
1.7%
33 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
Other values (48) 48
82.8%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%
50 1
1.7%
49 1
1.7%

업체명_대표자
Text

UNIQUE 

Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-11T01:43:56.925722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length9.4482759
Min length2

Characters and Unicode

Total characters548
Distinct characters217
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row잼아트(강*경)
2nd row늘푸른헤어(방*윤)
3rd row도란도란 노인복지센터(이*선)
4th row카페파스텔(김*옥)
5th row샤이독코리아(박*수)
ValueCountFrequency (%)
스터디카페 2
 
2.6%
큐(q)사랑 2
 
2.6%
잼아트(강*경 1
 
1.3%
일품하누진 1
 
1.3%
올히악세서리 1
 
1.3%
한국본부 1
 
1.3%
명상치유 1
 
1.3%
싱잉볼 1
 
1.3%
히말라얀 1
 
1.3%
미크베이커리(meek)(조*은 1
 
1.3%
Other values (65) 65
84.4%
2023-12-11T01:43:57.502278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 43
 
7.8%
) 43
 
7.8%
* 38
 
6.9%
19
 
3.5%
13
 
2.4%
10
 
1.8%
8
 
1.5%
8
 
1.5%
7
 
1.3%
7
 
1.3%
Other values (207) 352
64.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 379
69.2%
Open Punctuation 43
 
7.8%
Close Punctuation 43
 
7.8%
Other Punctuation 39
 
7.1%
Space Separator 19
 
3.5%
Uppercase Letter 12
 
2.2%
Lowercase Letter 11
 
2.0%
Decimal Number 1
 
0.2%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
3.4%
10
 
2.6%
8
 
2.1%
8
 
2.1%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.6%
6
 
1.6%
5
 
1.3%
Other values (183) 302
79.7%
Uppercase Letter
ValueCountFrequency (%)
Q 2
16.7%
M 2
16.7%
N 2
16.7%
E 1
8.3%
I 1
8.3%
D 1
8.3%
V 1
8.3%
O 1
8.3%
U 1
8.3%
Lowercase Letter
ValueCountFrequency (%)
e 2
18.2%
o 2
18.2%
l 2
18.2%
k 1
9.1%
i 1
9.1%
m 1
9.1%
d 1
9.1%
g 1
9.1%
Other Punctuation
ValueCountFrequency (%)
* 38
97.4%
, 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Space Separator
ValueCountFrequency (%)
19
100.0%
Decimal Number
ValueCountFrequency (%)
9 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 380
69.3%
Common 145
 
26.5%
Latin 23
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
3.4%
10
 
2.6%
8
 
2.1%
8
 
2.1%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.6%
6
 
1.6%
5
 
1.3%
Other values (184) 303
79.7%
Latin
ValueCountFrequency (%)
Q 2
 
8.7%
M 2
 
8.7%
e 2
 
8.7%
o 2
 
8.7%
l 2
 
8.7%
N 2
 
8.7%
E 1
 
4.3%
I 1
 
4.3%
D 1
 
4.3%
k 1
 
4.3%
Other values (7) 7
30.4%
Common
ValueCountFrequency (%)
( 43
29.7%
) 43
29.7%
* 38
26.2%
19
13.1%
9 1
 
0.7%
, 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 379
69.2%
ASCII 168
30.7%
None 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 43
25.6%
) 43
25.6%
* 38
22.6%
19
11.3%
Q 2
 
1.2%
M 2
 
1.2%
e 2
 
1.2%
o 2
 
1.2%
l 2
 
1.2%
N 2
 
1.2%
Other values (13) 13
 
7.7%
Hangul
ValueCountFrequency (%)
13
 
3.4%
10
 
2.6%
8
 
2.1%
8
 
2.1%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.6%
6
 
1.6%
5
 
1.3%
Other values (183) 302
79.7%
None
ValueCountFrequency (%)
1
100.0%

소재지 또는 홈페이지 주소
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)36.2%
Missing0
Missing (%)0.0%
Memory size596.0 B
부산광역시 남구
11 
부산광역시 수영구
부산광역시 부산진구
부산광역시 해운대구
부산광역시 연제구
Other values (16)
22 

Length

Max length43
Median length41
Mean length11.637931
Min length8

Unique

Unique12 ?
Unique (%)20.7%

Sample

1st row부산광역시 기장군
2nd row부산광역시 수영구
3rd row부산광역시 남구
4th row부산광역시 사상구
5th row부산광역시 남구

Common Values

ValueCountFrequency (%)
부산광역시 남구 11
19.0%
부산광역시 수영구 8
13.8%
부산광역시 부산진구 7
12.1%
부산광역시 해운대구 6
10.3%
부산광역시 연제구 4
 
6.9%
부산광역시 기장군 3
 
5.2%
부산광역시 동구 3
 
5.2%
부산광역시 사상구 2
 
3.4%
부산광역시 금정구 2
 
3.4%
www.kkamukpig.com 1
 
1.7%
Other values (11) 11
19.0%

Length

2023-12-11T01:43:57.717078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산광역시 51
46.4%
남구 11
 
10.0%
수영구 8
 
7.3%
부산진구 7
 
6.4%
해운대구 6
 
5.5%
연제구 4
 
3.6%
기장군 3
 
2.7%
동구 3
 
2.7%
사상구 2
 
1.8%
금정구 2
 
1.8%
Other values (13) 13
 
11.8%

사업내용
Categorical

Distinct17
Distinct (%)29.3%
Missing0
Missing (%)0.0%
Memory size596.0 B
소매업
10 
서비스
음식점
도소매업
음식
Other values (12)
22 

Length

Max length9
Median length3
Mean length3.5
Min length2

Unique

Unique9 ?
Unique (%)15.5%

Sample

1st row음식점
2nd row서비스
3rd row서비스
4th row카페
5th row소매업

Common Values

ValueCountFrequency (%)
소매업 10
17.2%
서비스 8
13.8%
음식점 7
12.1%
도소매업 6
10.3%
음식 5
8.6%
휴게음식점 5
8.6%
카페 4
 
6.9%
미용업 4
 
6.9%
서비스,소매 1
 
1.7%
제조소매 1
 
1.7%
Other values (7) 7
12.1%

Length

2023-12-11T01:43:58.186935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
소매업 11
16.7%
서비스 9
13.6%
음식점 7
10.6%
도소매업 6
9.1%
음식 5
7.6%
휴게음식점 5
7.6%
카페 4
 
6.1%
미용업 4
 
6.1%
4
 
6.1%
소매 3
 
4.5%
Other values (7) 8
12.1%

면적_제곱미터
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct8
Distinct (%)72.7%
Missing47
Missing (%)81.0%
Infinite0
Infinite (%)0.0%
Mean66.230909
Minimum26
Maximum198
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size654.0 B
2023-12-11T01:43:58.495282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26
5-th percentile26.22
Q133
median33.05
Q366
95-th percentile181.5
Maximum198
Range172
Interquartile range (IQR)33

Descriptive statistics

Standard deviation59.184299
Coefficient of variation (CV)0.89360541
Kurtosis1.9106582
Mean66.230909
Median Absolute Deviation (MAD)7.05
Skewness1.7520393
Sum728.54
Variance3502.7812
MonotonicityNot monotonic
2023-12-11T01:43:58.759629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
33.0 2
 
3.4%
66.0 2
 
3.4%
33.05 2
 
3.4%
198.0 1
 
1.7%
165.0 1
 
1.7%
49.0 1
 
1.7%
26.0 1
 
1.7%
26.44 1
 
1.7%
(Missing) 47
81.0%
ValueCountFrequency (%)
26.0 1
1.7%
26.44 1
1.7%
33.0 2
3.4%
33.05 2
3.4%
49.0 1
1.7%
66.0 2
3.4%
165.0 1
1.7%
198.0 1
1.7%
ValueCountFrequency (%)
198.0 1
1.7%
165.0 1
1.7%
66.0 2
3.4%
49.0 1
1.7%
33.05 2
3.4%
33.0 2
3.4%
26.44 1
1.7%
26.0 1
1.7%

종사자수_대표자포함
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5517241
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size654.0 B
2023-12-11T01:43:59.051598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile3.15
Maximum16
Range15
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.0704471
Coefficient of variation (CV)1.3342882
Kurtosis43.256407
Mean1.5517241
Median Absolute Deviation (MAD)0
Skewness6.2806335
Sum90
Variance4.2867514
MonotonicityNot monotonic
2023-12-11T01:43:59.246698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 46
79.3%
2 8
 
13.8%
16 1
 
1.7%
3 1
 
1.7%
5 1
 
1.7%
4 1
 
1.7%
ValueCountFrequency (%)
1 46
79.3%
2 8
 
13.8%
3 1
 
1.7%
4 1
 
1.7%
5 1
 
1.7%
16 1
 
1.7%
ValueCountFrequency (%)
16 1
 
1.7%
5 1
 
1.7%
4 1
 
1.7%
3 1
 
1.7%
2 8
 
13.8%
1 46
79.3%
Distinct50
Distinct (%)86.2%
Missing0
Missing (%)0.0%
Memory size596.0 B
Minimum2019-01-28 00:00:00
Maximum2023-07-14 00:00:00
2023-12-11T01:43:59.455160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:59.677345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-11T01:43:55.546227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:54.752128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:55.113428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:55.678204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:54.845473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:55.253301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:55.821303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:54.985946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:55.400451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:43:59.845206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명_대표자소재지 또는 홈페이지 주소사업내용면적_제곱미터종사자수_대표자포함창업일자
연번1.0001.0000.4080.6240.0000.0000.961
업체명_대표자1.0001.0001.0001.0001.0001.0001.000
소재지 또는 홈페이지 주소0.4081.0001.0000.6181.0000.0000.939
사업내용0.6241.0000.6181.0000.0000.4290.892
면적_제곱미터0.0001.0001.0000.0001.0000.8580.837
종사자수_대표자포함0.0001.0000.0000.4290.8581.0000.886
창업일자0.9611.0000.9390.8920.8370.8861.000
2023-12-11T01:44:00.013005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업내용소재지 또는 홈페이지 주소
사업내용1.0000.204
소재지 또는 홈페이지 주소0.2041.000
2023-12-11T01:44:00.162693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번면적_제곱미터종사자수_대표자포함소재지 또는 홈페이지 주소사업내용
연번1.000-0.554-0.3030.1160.268
면적_제곱미터-0.5541.0000.6730.7070.000
종사자수_대표자포함-0.3030.6731.0000.0000.208
소재지 또는 홈페이지 주소0.1160.7070.0001.0000.204
사업내용0.2680.0000.2080.2041.000

Missing values

2023-12-11T01:43:56.003265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:43:56.148921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명_대표자소재지 또는 홈페이지 주소사업내용면적_제곱미터종사자수_대표자포함창업일자
01잼아트(강*경)부산광역시 기장군음식점198.022019-01-28
12늘푸른헤어(방*윤)부산광역시 수영구서비스33.012019-02-28
23도란도란 노인복지센터(이*선)부산광역시 남구서비스66.0162019-04-02
34카페파스텔(김*옥)부산광역시 사상구카페165.032019-03-21
45샤이독코리아(박*수)부산광역시 남구소매업<NA>12019-06-20
56앨리스 펫(박*영)부산광역시 해운대구소매업33.012019-06-26
67밀양카페프렌즈(김*순)경상남도 밀양시카페66.012019-05-03
78쩐주단(최*선)부산광역시 부산진구카페49.052019-06-26
89윤미쌤의작은빵집(정*미)부산광역시 해운대구카페26.012019-06-03
910제이엘라운드(조*희)부산광역시 동구제조소매33.0512019-08-26
연번업체명_대표자소재지 또는 홈페이지 주소사업내용면적_제곱미터종사자수_대표자포함창업일자
4849달콤부엌부산광역시 해운대구휴게음식점<NA>12022-07-24
4950궁물부산광역시 사상구도소매업<NA>12022-04-25
5051윤슬한지공예연구실부산광역시 기장군도소매업<NA>12022-10-10
5152제전농원부산광역시 기장군휴게음식점<NA>12022-12-01
5253바루나 메이크업부산광역시 남구미용업<NA>12023-05-19
5354(카페)포레드샤부산광역시 연제구휴게음식점<NA>12023-05-24
5455OVENMIND(오븐마인드)부산광역시 남구휴게음식점<NA>12023-06-14
5556더쿠9부산광역시 남구소매업<NA>12023-04-01
5657good샵부산광역시 연제구미용업<NA>12023-06-18
5758하이고래부산광역시 수영구음식점<NA>12023-07-14