Overview

Dataset statistics

Number of variables7
Number of observations57
Missing cells4
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory59.3 B

Variable types

Numeric1
Text3
Categorical3

Dataset

Description부산광역시 부산진구내 건설기계사업자 현황정보입니다건설기계에 사용하는 중장비대여업 정보이며상호명,사업유형,등록종별,전화번호,주소 등의 정보를 제공하고 있습니다.
Author부산광역시 부산진구
URLhttps://www.data.go.kr/data/15025676/fileData.do

Alerts

기준일자 has constant value ""Constant
등록종별 is highly overall correlated with 사업유형High correlation
사업유형 is highly overall correlated with 등록종별High correlation
사업유형 is highly imbalanced (78.1%)Imbalance
전화번호 has 4 (7.0%) missing valuesMissing
연번 has unique valuesUnique
상호(명칭) has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:47:30.101278
Analysis finished2023-12-12 10:47:31.553412
Duration1.45 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29
Minimum1
Maximum57
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size645.0 B
2023-12-12T19:47:31.700967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.8
Q115
median29
Q343
95-th percentile54.2
Maximum57
Range56
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.598193
Coefficient of variation (CV)0.57235147
Kurtosis-1.2
Mean29
Median Absolute Deviation (MAD)14
Skewness0
Sum1653
Variance275.5
MonotonicityStrictly increasing
2023-12-12T19:47:31.938095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
44 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (47) 47
82.5%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
57 1
1.8%
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%

상호(명칭)
Text

UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T19:47:32.340941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length10
Mean length6.6315789
Min length3

Characters and Unicode

Total characters378
Distinct characters89
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)100.0%

Sample

1st row정원건설기계매매상사
2nd row천일건설기계매매상사
3rd row(주)대덕중기
4th row한신중기사
5th row두경건기컨설팅(주)
ValueCountFrequency (%)
정원건설기계매매상사 1
 
1.7%
유진중기(주 1
 
1.7%
태광개발 1
 
1.7%
으뜸종합중기 1
 
1.7%
동일건기 1
 
1.7%
중앙종합중기(주 1
 
1.7%
유광중기 1
 
1.7%
성림중기 1
 
1.7%
동일종합중기 1
 
1.7%
일동건기(주 1
 
1.7%
Other values (48) 48
82.8%
2023-12-12T19:47:32.926557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
12.7%
36
 
9.5%
30
 
7.9%
( 28
 
7.4%
) 28
 
7.4%
15
 
4.0%
13
 
3.4%
13
 
3.4%
11
 
2.9%
8
 
2.1%
Other values (79) 148
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 319
84.4%
Open Punctuation 28
 
7.4%
Close Punctuation 28
 
7.4%
Other Symbol 2
 
0.5%
Space Separator 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
15.0%
36
 
11.3%
30
 
9.4%
15
 
4.7%
13
 
4.1%
13
 
4.1%
11
 
3.4%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (75) 131
41.1%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 321
84.9%
Common 57
 
15.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
15.0%
36
 
11.2%
30
 
9.3%
15
 
4.7%
13
 
4.0%
13
 
4.0%
11
 
3.4%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (76) 133
41.4%
Common
ValueCountFrequency (%)
( 28
49.1%
) 28
49.1%
1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 319
84.4%
ASCII 57
 
15.1%
None 2
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
15.0%
36
 
11.3%
30
 
9.4%
15
 
4.7%
13
 
4.1%
13
 
4.1%
11
 
3.4%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (75) 131
41.1%
ASCII
ValueCountFrequency (%)
( 28
49.1%
) 28
49.1%
1
 
1.8%
None
ValueCountFrequency (%)
2
100.0%

사업유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
대여업
55 
매매업
 
2

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row매매업
2nd row매매업
3rd row대여업
4th row대여업
5th row대여업

Common Values

ValueCountFrequency (%)
대여업 55
96.5%
매매업 2
 
3.5%

Length

2023-12-12T19:47:33.118478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:47:33.248178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대여업 55
96.5%
매매업 2
 
3.5%

등록종별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size588.0 B
일반
43 
개별
12 
<NA>
 
2

Length

Max length4
Median length2
Mean length2.0701754
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 43
75.4%
개별 12
 
21.1%
<NA> 2
 
3.5%

Length

2023-12-12T19:47:33.403227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:47:33.552515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 43
75.4%
개별 12
 
21.1%
na 2
 
3.5%

전화번호
Text

MISSING 

Distinct50
Distinct (%)94.3%
Missing4
Missing (%)7.0%
Memory size588.0 B
2023-12-12T19:47:33.837683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters636
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)88.7%

Sample

1st row051-805-7494
2nd row051-806-2538
3rd row051-803-2522
4th row051-802-4777
5th row051-807-2007
ValueCountFrequency (%)
051-817-4411 2
 
3.8%
051-806-2538 2
 
3.8%
051-804-8804 2
 
3.8%
051-802-0405 1
 
1.9%
051-817-0330 1
 
1.9%
051-505-0917 1
 
1.9%
051-805-9999 1
 
1.9%
051-805-7494 1
 
1.9%
051-811-1424 1
 
1.9%
051-819-4442 1
 
1.9%
Other values (40) 40
75.5%
2023-12-12T19:47:34.293689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 115
18.1%
1 107
16.8%
- 106
16.7%
8 81
12.7%
5 75
11.8%
4 36
 
5.7%
7 29
 
4.6%
3 27
 
4.2%
6 20
 
3.1%
2 20
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 530
83.3%
Dash Punctuation 106
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 115
21.7%
1 107
20.2%
8 81
15.3%
5 75
14.2%
4 36
 
6.8%
7 29
 
5.5%
3 27
 
5.1%
6 20
 
3.8%
2 20
 
3.8%
9 20
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 106
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 636
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 115
18.1%
1 107
16.8%
- 106
16.7%
8 81
12.7%
5 75
11.8%
4 36
 
5.7%
7 29
 
4.6%
3 27
 
4.2%
6 20
 
3.1%
2 20
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 636
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 115
18.1%
1 107
16.8%
- 106
16.7%
8 81
12.7%
5 75
11.8%
4 36
 
5.7%
7 29
 
4.6%
3 27
 
4.2%
6 20
 
3.1%
2 20
 
3.1%

주소
Text

Distinct46
Distinct (%)80.7%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T19:47:34.622302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length41
Mean length34.982456
Min length22

Characters and Unicode

Total characters1994
Distinct characters117
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)66.7%

Sample

1st row부산광역시 부산진구 서전로47번길 19(전포동)
2nd row부산광역시 부산진구 중앙대로 862, 오피스텔동 224호(전포동, 전포LH아파트)
3rd row부산광역시 부산진구 서전로37번길 25-9(전포동)
4th row부산광역시 부산진구 서전로47번길 17, 철물상가A동 라301호
5th row부산광역시 부산진구 서전로37번길 25-9(전포동)
ValueCountFrequency (%)
부산광역시 57
 
17.6%
부산진구 57
 
17.6%
서전로37번길 14
 
4.3%
25-9 11
 
3.4%
예일프라자 11
 
3.4%
동천로 9
 
2.8%
116 5
 
1.5%
서전로47번길 5
 
1.5%
한신밴 4
 
1.2%
305호 4
 
1.2%
Other values (112) 146
45.2%
2023-12-12T19:47:35.106976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
266
 
13.3%
116
 
5.8%
114
 
5.7%
, 71
 
3.6%
70
 
3.5%
1 65
 
3.3%
58
 
2.9%
58
 
2.9%
58
 
2.9%
57
 
2.9%
Other values (107) 1061
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1169
58.6%
Decimal Number 378
 
19.0%
Space Separator 266
 
13.3%
Other Punctuation 72
 
3.6%
Close Punctuation 41
 
2.1%
Open Punctuation 41
 
2.1%
Dash Punctuation 17
 
0.9%
Uppercase Letter 10
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
116
 
9.9%
114
 
9.8%
70
 
6.0%
58
 
5.0%
58
 
5.0%
58
 
5.0%
57
 
4.9%
57
 
4.9%
57
 
4.9%
56
 
4.8%
Other values (85) 468
40.0%
Decimal Number
ValueCountFrequency (%)
1 65
17.2%
3 51
13.5%
0 49
13.0%
2 47
12.4%
5 40
10.6%
7 35
9.3%
9 32
8.5%
4 26
 
6.9%
6 24
 
6.3%
8 9
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
B 2
20.0%
A 2
20.0%
H 2
20.0%
L 2
20.0%
T 1
10.0%
O 1
10.0%
Other Punctuation
ValueCountFrequency (%)
, 71
98.6%
/ 1
 
1.4%
Space Separator
ValueCountFrequency (%)
266
100.0%
Close Punctuation
ValueCountFrequency (%)
) 41
100.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1169
58.6%
Common 815
40.9%
Latin 10
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
116
 
9.9%
114
 
9.8%
70
 
6.0%
58
 
5.0%
58
 
5.0%
58
 
5.0%
57
 
4.9%
57
 
4.9%
57
 
4.9%
56
 
4.8%
Other values (85) 468
40.0%
Common
ValueCountFrequency (%)
266
32.6%
, 71
 
8.7%
1 65
 
8.0%
3 51
 
6.3%
0 49
 
6.0%
2 47
 
5.8%
) 41
 
5.0%
( 41
 
5.0%
5 40
 
4.9%
7 35
 
4.3%
Other values (6) 109
13.4%
Latin
ValueCountFrequency (%)
B 2
20.0%
A 2
20.0%
H 2
20.0%
L 2
20.0%
T 1
10.0%
O 1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1169
58.6%
ASCII 825
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
266
32.2%
, 71
 
8.6%
1 65
 
7.9%
3 51
 
6.2%
0 49
 
5.9%
2 47
 
5.7%
) 41
 
5.0%
( 41
 
5.0%
5 40
 
4.8%
7 35
 
4.2%
Other values (12) 119
14.4%
Hangul
ValueCountFrequency (%)
116
 
9.9%
114
 
9.8%
70
 
6.0%
58
 
5.0%
58
 
5.0%
58
 
5.0%
57
 
4.9%
57
 
4.9%
57
 
4.9%
56
 
4.8%
Other values (85) 468
40.0%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-10-17
57 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-17
2nd row2023-10-17
3rd row2023-10-17
4th row2023-10-17
5th row2023-10-17

Common Values

ValueCountFrequency (%)
2023-10-17 57
100.0%

Length

2023-12-12T19:47:35.289905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:47:35.413767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-17 57
100.0%

Interactions

2023-12-12T19:47:31.073733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:47:35.499065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호(명칭)사업유형등록종별전화번호주소
연번1.0001.0000.5490.2870.8520.692
상호(명칭)1.0001.0001.0001.0001.0001.000
사업유형0.5491.0001.000NaN0.0000.000
등록종별0.2871.000NaN1.0001.0000.933
전화번호0.8521.0000.0001.0001.0000.999
주소0.6921.0000.0000.9330.9991.000
2023-12-12T19:47:35.649393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록종별사업유형
등록종별1.0001.000
사업유형1.0001.000
2023-12-12T19:47:35.754080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업유형등록종별
연번1.0000.3890.195
사업유형0.3891.0001.000
등록종별0.1951.0001.000

Missing values

2023-12-12T19:47:31.263940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:47:31.463699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호(명칭)사업유형등록종별전화번호주소기준일자
01정원건설기계매매상사매매업<NA>051-805-7494부산광역시 부산진구 서전로47번길 19(전포동)2023-10-17
12천일건설기계매매상사매매업<NA>051-806-2538부산광역시 부산진구 중앙대로 862, 오피스텔동 224호(전포동, 전포LH아파트)2023-10-17
23(주)대덕중기대여업일반051-803-2522부산광역시 부산진구 서전로37번길 25-9(전포동)2023-10-17
34한신중기사대여업일반051-802-4777부산광역시 부산진구 서전로47번길 17, 철물상가A동 라301호2023-10-17
45두경건기컨설팅(주)대여업일반051-807-2007부산광역시 부산진구 서전로37번길 25-9(전포동)2023-10-17
56아주건기(주)대여업일반051-817-4411부산광역시 부산진구 서전로37번길 25-9, 305호(전포동, 예일프라자)2023-10-17
67남국중기대여업일반051-816-1936부산광역시 부산진구 서전로37번길 25-9, 3층 309호(전포동, 예일프라자)2023-10-17
78대동종합중기(주)대여업일반051-807-6280부산광역시 부산진구 서전로37번길 25-9, 예일프라자 3052023-10-17
89대림중기(주)대여업일반051-806-0097부산광역시 부산진구 서전로37번길 25-9, 예일프라자 305호2023-10-17
910일동기업(주)대여업일반051-816-6033부산광역시 부산진구 동천로108번길 11, 창원빌딩 701호2023-10-17
연번상호(명칭)사업유형등록종별전화번호주소기준일자
4748대야기업(주)대여업일반051-804-1488부산광역시 부산진구 서전로37번길 25-9, 예일프라자 305호2023-10-17
4849육일건기(주)대여업일반051-818-9970부산광역시 부산진구 동천로 116, 1012호(전포동, 한신밴)2023-10-17
4950그린중기대여업일반051-895-5001부산광역시 부산진구 동평로94번길 34(당감동)2023-10-17
5051금호종합중기대여업일반051-811-1300부산광역시 부산진구 동천로 116(전포동)2023-10-17
5152한라건기대여업일반051-804-4111부산광역시 부산진구 동성로 133, 동연아미가 1406호2023-10-17
5253(주)성신종합중기대여업일반051-802-0405부산광역시 부산진구 전포대로255번길 41(전포동)2023-10-17
5354(주)지산리사이클링대여업개별051-852-8333부산광역시 부산진구 연수로 31(양정동)2023-10-17
5455신호지게차㈜대여업개별051-806-2538부산광역시 부산진구 중앙대로 862, 오피스텔동 224호(전포동, 전포LH아파트)2023-10-17
5556㈜더난건설대여업개별051-515-6726부산광역시 부산진구 중앙대로993,2606호(양정동,시청역롯데골드로즈)2023-10-17
5657대한건설기계종합중기 주식회사대여업개별051-861-4088부산광역시 부산진구 동평로406번길 71,3층(양정동)2023-10-17