Overview

Dataset statistics

Number of variables6
Number of observations283
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.7 KiB
Average record size in memory49.5 B

Variable types

Numeric1
Categorical4
Text1

Dataset

Description부산광역시_금정구_건설기계사업자현황_20220210
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025829

Alerts

상태 has constant value ""Constant
주소 is highly overall correlated with 사업유형 and 1 other fieldsHigh correlation
사업유형 is highly overall correlated with 등록종별 and 1 other fieldsHigh correlation
등록종별 is highly overall correlated with 사업유형 and 1 other fieldsHigh correlation
사업유형 is highly imbalanced (91.1%)Imbalance
등록종별 is highly imbalanced (76.0%)Imbalance
주소 is highly imbalanced (81.0%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:16:38.208411
Analysis finished2023-12-10 17:16:40.109756
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct283
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean142
Minimum1
Maximum283
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-11T02:16:40.355098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.1
Q171.5
median142
Q3212.5
95-th percentile268.9
Maximum283
Range282
Interquartile range (IQR)141

Descriptive statistics

Standard deviation81.839273
Coefficient of variation (CV)0.57633291
Kurtosis-1.2
Mean142
Median Absolute Deviation (MAD)71
Skewness0
Sum40186
Variance6697.6667
MonotonicityStrictly increasing
2023-12-11T02:16:40.747696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
188 1
 
0.4%
194 1
 
0.4%
193 1
 
0.4%
192 1
 
0.4%
191 1
 
0.4%
190 1
 
0.4%
189 1
 
0.4%
187 1
 
0.4%
196 1
 
0.4%
Other values (273) 273
96.5%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
283 1
0.4%
282 1
0.4%
281 1
0.4%
280 1
0.4%
279 1
0.4%
278 1
0.4%
277 1
0.4%
276 1
0.4%
275 1
0.4%
274 1
0.4%

상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
영업
283 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 283
100.0%

Length

2023-12-11T02:16:41.149432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:41.431374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 283
100.0%
Distinct282
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-11T02:16:42.215773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length6
Mean length6.5088339
Min length4

Characters and Unicode

Total characters1842
Distinct characters141
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique281 ?
Unique (%)99.3%

Sample

1st row(주)제일종합중기
2nd row경동제10호(김상오)
3rd row경동제12호(하천수)
4th row경동제14호(차무준)
5th row경동제15호(김원태)
ValueCountFrequency (%)
세기tower 2
 
0.7%
쌍용제14호 1
 
0.3%
배주원 1
 
0.3%
경동제1호 1
 
0.3%
부일제11호 1
 
0.3%
부일제15호 1
 
0.3%
성영제4호 1
 
0.3%
항도제1호 1
 
0.3%
개별제67호 1
 
0.3%
개별제16호 1
 
0.3%
Other values (280) 280
96.2%
2023-12-11T02:16:43.543358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
268
 
14.5%
264
 
14.3%
1 94
 
5.1%
2 78
 
4.2%
3 63
 
3.4%
57
 
3.1%
57
 
3.1%
55
 
3.0%
55
 
3.0%
4 51
 
2.8%
Other values (131) 800
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1292
70.1%
Decimal Number 452
 
24.5%
Open Punctuation 39
 
2.1%
Close Punctuation 39
 
2.1%
Uppercase Letter 12
 
0.7%
Space Separator 8
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
268
20.7%
264
20.4%
57
 
4.4%
57
 
4.4%
55
 
4.3%
55
 
4.3%
48
 
3.7%
38
 
2.9%
38
 
2.9%
35
 
2.7%
Other values (111) 377
29.2%
Decimal Number
ValueCountFrequency (%)
1 94
20.8%
2 78
17.3%
3 63
13.9%
4 51
11.3%
5 41
9.1%
7 35
 
7.7%
6 27
 
6.0%
8 22
 
4.9%
9 22
 
4.9%
0 19
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
O 2
16.7%
W 2
16.7%
E 2
16.7%
R 2
16.7%
T 2
16.7%
C 1
8.3%
H 1
8.3%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1292
70.1%
Common 538
29.2%
Latin 12
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
268
20.7%
264
20.4%
57
 
4.4%
57
 
4.4%
55
 
4.3%
55
 
4.3%
48
 
3.7%
38
 
2.9%
38
 
2.9%
35
 
2.7%
Other values (111) 377
29.2%
Common
ValueCountFrequency (%)
1 94
17.5%
2 78
14.5%
3 63
11.7%
4 51
9.5%
5 41
7.6%
( 39
7.2%
) 39
7.2%
7 35
 
6.5%
6 27
 
5.0%
8 22
 
4.1%
Other values (3) 49
9.1%
Latin
ValueCountFrequency (%)
O 2
16.7%
W 2
16.7%
E 2
16.7%
R 2
16.7%
T 2
16.7%
C 1
8.3%
H 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1292
70.1%
ASCII 550
29.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
268
20.7%
264
20.4%
57
 
4.4%
57
 
4.4%
55
 
4.3%
55
 
4.3%
48
 
3.7%
38
 
2.9%
38
 
2.9%
35
 
2.7%
Other values (111) 377
29.2%
ASCII
ValueCountFrequency (%)
1 94
17.1%
2 78
14.2%
3 63
11.5%
4 51
9.3%
5 41
7.5%
( 39
7.1%
) 39
7.1%
7 35
 
6.4%
6 27
 
4.9%
8 22
 
4.0%
Other values (10) 61
11.1%

사업유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
대여업
278 
매매업
 
4
해체재활용업
 
1

Length

Max length6
Median length3
Mean length3.0106007
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row대여업
2nd row대여업
3rd row대여업
4th row대여업
5th row대여업

Common Values

ValueCountFrequency (%)
대여업 278
98.2%
매매업 4
 
1.4%
해체재활용업 1
 
0.4%

Length

2023-12-11T02:16:43.985069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:44.302729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대여업 278
98.2%
매매업 4
 
1.4%
해체재활용업 1
 
0.4%

등록종별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
개별
266 
일반
 
12
<NA>
 
5

Length

Max length4
Median length2
Mean length2.0353357
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row개별
3rd row개별
4th row개별
5th row개별

Common Values

ValueCountFrequency (%)
개별 266
94.0%
일반 12
 
4.2%
<NA> 5
 
1.8%

Length

2023-12-11T02:16:44.799189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:45.168696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개별 266
94.0%
일반 12
 
4.2%
na 5
 
1.8%

주소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct23
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
부산광역시 금정구 중앙대로 2027, 501호(남산동)
256 
부산광역시 금정구 중앙대로 2027(남산동)
 
5
부산광역시 금정구 명서로 76, 101호(서동, 삼한아파트상가)
 
2
부산광역시 금정구 명서로 94, 부산성지아파트 101동 312호
 
1
부산광역시 금정구 중앙대로 1799, 구서동유림노르웨이아침 333호
 
1
Other values (18)
 
18

Length

Max length44
Median length30
Mean length30.014134
Min length21

Unique

Unique20 ?
Unique (%)7.1%

Sample

1st row부산광역시 금정구 금정로 249(구서동)
2nd row부산광역시 금정구 중앙대로 2027, 501호(남산동)
3rd row부산광역시 금정구 중앙대로 2027, 501호(남산동)
4th row부산광역시 금정구 중앙대로 2027, 501호(남산동)
5th row부산광역시 금정구 중앙대로 2027, 501호(남산동)

Common Values

ValueCountFrequency (%)
부산광역시 금정구 중앙대로 2027, 501호(남산동) 256
90.5%
부산광역시 금정구 중앙대로 2027(남산동) 5
 
1.8%
부산광역시 금정구 명서로 76, 101호(서동, 삼한아파트상가) 2
 
0.7%
부산광역시 금정구 명서로 94, 부산성지아파트 101동 312호 1
 
0.4%
부산광역시 금정구 중앙대로 1799, 구서동유림노르웨이아침 333호 1
 
0.4%
부산광역시 금정구 노포사송로 143, 1층(노포동) 1
 
0.4%
부산광역시 금정구 중앙대로 2389(두구동) 1
 
0.4%
부산광역시 금정구 중앙대로 1799, 유림오피스텔 306호 1
 
0.4%
부산광역시 금정구 부곡로 140, 406호(부곡동) 1
 
0.4%
부산광역시 금정구 시실로 13(부곡동) 1
 
0.4%
Other values (13) 13
 
4.6%

Length

2023-12-11T02:16:45.569206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산광역시 283
19.9%
금정구 283
19.9%
중앙대로 265
18.6%
2027 256
18.0%
501호(남산동 256
18.0%
2027(남산동 5
 
0.4%
명서로 3
 
0.2%
1799 3
 
0.2%
금샘로 2
 
0.1%
333호 2
 
0.1%
Other values (60) 64
 
4.5%

Interactions

2023-12-11T02:16:39.287315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:16:45.877928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업유형등록종별주소
순번1.0000.0950.1290.155
사업유형0.0951.000NaN0.986
등록종별0.129NaN1.0000.932
주소0.1550.9860.9321.000
2023-12-11T02:16:46.272832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주소사업유형등록종별
주소1.0000.9300.875
사업유형0.9301.0001.000
등록종별0.8751.0001.000
2023-12-11T02:16:46.559072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업유형등록종별주소
순번1.0000.0540.0970.054
사업유형0.0541.0001.0000.930
등록종별0.0971.0001.0000.875
주소0.0540.9300.8751.000

Missing values

2023-12-11T02:16:39.646119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:16:40.002373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상태상호명칭사업유형등록종별주소
01영업(주)제일종합중기대여업일반부산광역시 금정구 금정로 249(구서동)
12영업경동제10호(김상오)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
23영업경동제12호(하천수)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
34영업경동제14호(차무준)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
45영업경동제15호(김원태)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
56영업경동제21호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
67영업경동제22호(김진도)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
78영업경동제25호(신흥석)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
89영업경동제26호(배영환)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
910영업경동제41호(문수환)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
순번상태상호명칭사업유형등록종별주소
273274영업쌍용제21호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
274275영업개별제5호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
275276영업(주)디알산업개발대여업개별부산광역시 금정구 중앙대로1805번길 6, 1205호(구서동, 삼백크라시앙)
276277영업개별제2호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
277278영업개별제9호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
278279영업대양제11호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
279280영업개별제31호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
280281영업경동제31호대여업일반부산광역시 금정구 중앙대로 2027, 501호(남산동)
281282영업경동제32호대여업일반부산광역시 금정구 중앙대로 2027, 501호(남산동)
282283영업대양 제 34호대여업개별부산광역시 금정구 남산동 117번지 4호 501호