Overview

Dataset statistics

Number of variables6
Number of observations283
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.7 KiB
Average record size in memory49.5 B

Variable types

Numeric1
Categorical4
Text1

Dataset

Description부산광역시_금정구_건설기계사업자현황_20230214
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025829

Alerts

상태 has constant value ""Constant
사업유형 is highly overall correlated with 등록종별 and 1 other fieldsHigh correlation
등록종별 is highly overall correlated with 사업유형 and 1 other fieldsHigh correlation
주소 is highly overall correlated with 사업유형 and 1 other fieldsHigh correlation
사업유형 is highly imbalanced (89.9%)Imbalance
등록종별 is highly imbalanced (76.5%)Imbalance
주소 is highly imbalanced (78.9%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:16:28.175985
Analysis finished2023-12-10 17:16:29.331317
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct283
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean142
Minimum1
Maximum283
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-11T02:16:29.527445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.1
Q171.5
median142
Q3212.5
95-th percentile268.9
Maximum283
Range282
Interquartile range (IQR)141

Descriptive statistics

Standard deviation81.839273
Coefficient of variation (CV)0.57633291
Kurtosis-1.2
Mean142
Median Absolute Deviation (MAD)71
Skewness0
Sum40186
Variance6697.6667
MonotonicityStrictly increasing
2023-12-11T02:16:29.915220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
188 1
 
0.4%
194 1
 
0.4%
193 1
 
0.4%
192 1
 
0.4%
191 1
 
0.4%
190 1
 
0.4%
189 1
 
0.4%
187 1
 
0.4%
196 1
 
0.4%
Other values (273) 273
96.5%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
283 1
0.4%
282 1
0.4%
281 1
0.4%
280 1
0.4%
279 1
0.4%
278 1
0.4%
277 1
0.4%
276 1
0.4%
275 1
0.4%
274 1
0.4%

상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
영업
283 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 283
100.0%

Length

2023-12-11T02:16:30.231395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:30.441763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 283
100.0%
Distinct259
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-11T02:16:30.997904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length6
Mean length6.4770318
Min length4

Characters and Unicode

Total characters1833
Distinct characters138
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique236 ?
Unique (%)83.4%

Sample

1st row동부폐차장
2nd row대호건기
3rd row금정합동건설기계매매상사
4th row(주)제일종합중기
5th row경동제10호(김상오)
ValueCountFrequency (%)
세기tower 3
 
1.0%
대양제33호 2
 
0.7%
쌍용제15호 2
 
0.7%
개별제60호 2
 
0.7%
대양제11호 2
 
0.7%
주)디알산업개발 2
 
0.7%
개별제2호 2
 
0.7%
개별제5호 2
 
0.7%
경동제11호 2
 
0.7%
개별제61호 2
 
0.7%
Other values (259) 274
92.9%
2023-12-11T02:16:31.955261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
263
 
14.3%
260
 
14.2%
1 97
 
5.3%
2 79
 
4.3%
67
 
3.7%
64
 
3.5%
3 60
 
3.3%
56
 
3.1%
53
 
2.9%
4 45
 
2.5%
Other values (128) 789
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1281
69.9%
Decimal Number 449
 
24.5%
Open Punctuation 37
 
2.0%
Close Punctuation 37
 
2.0%
Uppercase Letter 17
 
0.9%
Space Separator 12
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
20.5%
260
20.3%
67
 
5.2%
64
 
5.0%
56
 
4.4%
53
 
4.1%
40
 
3.1%
39
 
3.0%
36
 
2.8%
31
 
2.4%
Other values (108) 372
29.0%
Decimal Number
ValueCountFrequency (%)
1 97
21.6%
2 79
17.6%
3 60
13.4%
4 45
10.0%
5 44
9.8%
7 33
 
7.3%
6 29
 
6.5%
8 25
 
5.6%
9 21
 
4.7%
0 16
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
T 3
17.6%
O 3
17.6%
W 3
17.6%
R 3
17.6%
E 3
17.6%
H 1
 
5.9%
C 1
 
5.9%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1281
69.9%
Common 535
29.2%
Latin 17
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
20.5%
260
20.3%
67
 
5.2%
64
 
5.0%
56
 
4.4%
53
 
4.1%
40
 
3.1%
39
 
3.0%
36
 
2.8%
31
 
2.4%
Other values (108) 372
29.0%
Common
ValueCountFrequency (%)
1 97
18.1%
2 79
14.8%
3 60
11.2%
4 45
8.4%
5 44
8.2%
( 37
 
6.9%
) 37
 
6.9%
7 33
 
6.2%
6 29
 
5.4%
8 25
 
4.7%
Other values (3) 49
9.2%
Latin
ValueCountFrequency (%)
T 3
17.6%
O 3
17.6%
W 3
17.6%
R 3
17.6%
E 3
17.6%
H 1
 
5.9%
C 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1281
69.9%
ASCII 552
30.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
263
20.5%
260
20.3%
67
 
5.2%
64
 
5.0%
56
 
4.4%
53
 
4.1%
40
 
3.1%
39
 
3.0%
36
 
2.8%
31
 
2.4%
Other values (108) 372
29.0%
ASCII
ValueCountFrequency (%)
1 97
17.6%
2 79
14.3%
3 60
10.9%
4 45
8.2%
5 44
8.0%
( 37
 
6.7%
) 37
 
6.7%
7 33
 
6.0%
6 29
 
5.3%
8 25
 
4.5%
Other values (10) 66
12.0%

사업유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
대여업
276 
매매업
 
4
정비업
 
2
해체재활용업
 
1

Length

Max length6
Median length3
Mean length3.0106007
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row해체재활용업
2nd row매매업
3rd row매매업
4th row대여업
5th row대여업

Common Values

ValueCountFrequency (%)
대여업 276
97.5%
매매업 4
 
1.4%
정비업 2
 
0.7%
해체재활용업 1
 
0.4%

Length

2023-12-11T02:16:32.244481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:32.471683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대여업 276
97.5%
매매업 4
 
1.4%
정비업 2
 
0.7%
해체재활용업 1
 
0.4%

등록종별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
개별
262 
일반
 
14
<NA>
 
5
전문(타워크레인)
 
2

Length

Max length9
Median length2
Mean length2.0848057
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row일반
5th row개별

Common Values

ValueCountFrequency (%)
개별 262
92.6%
일반 14
 
4.9%
<NA> 5
 
1.8%
전문(타워크레인) 2
 
0.7%

Length

2023-12-11T02:16:32.732383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:32.991954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개별 262
92.6%
일반 14
 
4.9%
na 5
 
1.8%
전문(타워크레인 2
 
0.7%

주소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct25
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
부산광역시 금정구 중앙대로 2027, 501호(남산동)
252 
부산광역시 금정구 중앙대로 2027(남산동)
 
4
부산광역시 금정구 명서로 76, 101호(서동, 삼한아파트상가)
 
3
부산광역시 금정구 남산동 117번지 4호 501호
 
2
부산광역시 금정구 중앙대로1805번길 6, 1205호(구서동, 삼백크라시앙)
 
2
Other values (20)
 
20

Length

Max length44
Median length30
Mean length30.056537
Min length21

Unique

Unique20 ?
Unique (%)7.1%

Sample

1st row부산광역시 금정구 개좌로 227(회동동)
2nd row부산광역시 금정구 중앙대로 1799, 구서동유림노르웨이아침 333호
3rd row부산광역시 금정구 중앙대로 2086, 2층(남산동)
4th row부산광역시 금정구 금정로 249(구서동)
5th row부산광역시 금정구 중앙대로 2027, 501호(남산동)

Common Values

ValueCountFrequency (%)
부산광역시 금정구 중앙대로 2027, 501호(남산동) 252
89.0%
부산광역시 금정구 중앙대로 2027(남산동) 4
 
1.4%
부산광역시 금정구 명서로 76, 101호(서동, 삼한아파트상가) 3
 
1.1%
부산광역시 금정구 남산동 117번지 4호 501호 2
 
0.7%
부산광역시 금정구 중앙대로1805번길 6, 1205호(구서동, 삼백크라시앙) 2
 
0.7%
부산광역시 금정구 중앙대로 1799, 구서동유림노르웨이아침 333호 1
 
0.4%
부산광역시 금정구 중앙대로 2086, 2층(남산동) 1
 
0.4%
부산광역시 금정구 금정로 249(구서동) 1
 
0.4%
부산광역시 금정구 중앙대로 2389(두구동) 1
 
0.4%
부산광역시 금정구 중앙대로 1799, 유림오피스텔 306호 1
 
0.4%
Other values (15) 15
 
5.3%

Length

2023-12-11T02:16:33.254397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산광역시 283
19.9%
금정구 283
19.9%
중앙대로 261
18.3%
2027 252
17.7%
501호(남산동 252
17.7%
2027(남산동 4
 
0.3%
명서로 4
 
0.3%
76 3
 
0.2%
101호(서동 3
 
0.2%
삼한아파트상가 3
 
0.2%
Other values (62) 77
 
5.4%

Interactions

2023-12-11T02:16:28.671644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:16:33.456877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업유형등록종별주소
순번1.0000.1870.1830.120
사업유형0.1871.0001.0000.974
등록종별0.1831.0001.0000.975
주소0.1200.9740.9751.000
2023-12-11T02:16:33.674068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주소사업유형등록종별
주소1.0000.8690.801
사업유형0.8691.0000.998
등록종별0.8010.9981.000
2023-12-11T02:16:33.887933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업유형등록종별주소
순번1.0000.1110.1090.037
사업유형0.1111.0000.9980.869
등록종별0.1090.9981.0000.801
주소0.0370.8690.8011.000

Missing values

2023-12-11T02:16:28.948449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:16:29.228932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상태상호명칭사업유형등록종별주소
01영업동부폐차장해체재활용업<NA>부산광역시 금정구 개좌로 227(회동동)
12영업대호건기매매업<NA>부산광역시 금정구 중앙대로 1799, 구서동유림노르웨이아침 333호
23영업금정합동건설기계매매상사매매업<NA>부산광역시 금정구 중앙대로 2086, 2층(남산동)
34영업(주)제일종합중기대여업일반부산광역시 금정구 금정로 249(구서동)
45영업경동제10호(김상오)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
56영업경동제12호(하천수)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
67영업경동제14호(차무준)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
78영업경동제15호(김원태)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
89영업경동제21호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
910영업경동제22호(김진도)대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
순번상태상호명칭사업유형등록종별주소
273274영업쌍용제21호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
274275영업개별제5호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
275276영업(주)디알산업개발대여업개별부산광역시 금정구 중앙대로1805번길 6, 1205호(구서동, 삼백크라시앙)
276277영업개별제2호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
277278영업개별제9호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
278279영업대양제11호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
279280영업개별제31호대여업개별부산광역시 금정구 중앙대로 2027, 501호(남산동)
280281영업경동제31호대여업일반부산광역시 금정구 중앙대로 2027, 501호(남산동)
281282영업경동제32호대여업일반부산광역시 금정구 중앙대로 2027, 501호(남산동)
282283영업대양 제 34호대여업개별부산광역시 금정구 남산동 117번지 4호 501호