Overview

Dataset statistics

Number of variables4
Number of observations561
Missing cells2
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.2 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description부산광역시 강서구 내 전문건설업 현황입니다. 포함하고 있는 데이터는 다음과 같습니다. (연번, 상호, 업종, 영업소재지(도로명주소))
Author공공데이터포털
URLhttps://www.data.go.kr/data/3045937/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2024-02-10 13:05:46.377914
Analysis finished2024-02-10 13:05:51.821850
Duration5.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct561
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean281
Minimum1
Maximum561
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.1 KiB
2024-02-10T13:05:52.025996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29
Q1141
median281
Q3421
95-th percentile533
Maximum561
Range560
Interquartile range (IQR)280

Descriptive statistics

Standard deviation162.09102
Coefficient of variation (CV)0.57683638
Kurtosis-1.2
Mean281
Median Absolute Deviation (MAD)140
Skewness0
Sum157641
Variance26273.5
MonotonicityStrictly increasing
2024-02-10T13:05:52.586821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
378 1
 
0.2%
372 1
 
0.2%
373 1
 
0.2%
374 1
 
0.2%
375 1
 
0.2%
376 1
 
0.2%
377 1
 
0.2%
379 1
 
0.2%
370 1
 
0.2%
Other values (551) 551
98.2%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
561 1
0.2%
560 1
0.2%
559 1
0.2%
558 1
0.2%
557 1
0.2%
556 1
0.2%
555 1
0.2%
554 1
0.2%
553 1
0.2%
552 1
0.2%

상호
Text

Distinct419
Distinct (%)74.7%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2024-02-10T13:05:53.283506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length7.6737968
Min length3

Characters and Unicode

Total characters4305
Distinct characters295
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique317 ?
Unique (%)56.5%

Sample

1st row(유)금문건설
2nd row(유)한음이엔지
3rd row(유)한음이엔지
4th row(주)가람테크
5th row(주)가안이엔씨
ValueCountFrequency (%)
주)거도산업 6
 
1.1%
초석에이치디주식회사 6
 
1.1%
엘코미(주 5
 
0.9%
씨·티(c·t 4
 
0.7%
주식회사신영토건 4
 
0.7%
주)그루빅건설 4
 
0.7%
주)삼공사 4
 
0.7%
동아정밀공업사 4
 
0.7%
금정산업건설 4
 
0.7%
주)누리세움 4
 
0.7%
Other values (409) 516
92.0%
2024-02-10T13:05:54.348894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
483
 
11.2%
) 398
 
9.2%
( 398
 
9.2%
136
 
3.2%
116
 
2.7%
112
 
2.6%
99
 
2.3%
98
 
2.3%
96
 
2.2%
74
 
1.7%
Other values (285) 2295
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3459
80.3%
Close Punctuation 398
 
9.2%
Open Punctuation 398
 
9.2%
Uppercase Letter 31
 
0.7%
Other Punctuation 12
 
0.3%
Other Symbol 7
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
483
 
14.0%
136
 
3.9%
116
 
3.4%
112
 
3.2%
99
 
2.9%
98
 
2.8%
96
 
2.8%
74
 
2.1%
71
 
2.1%
67
 
1.9%
Other values (265) 2107
60.9%
Uppercase Letter
ValueCountFrequency (%)
C 6
19.4%
T 5
16.1%
E 5
16.1%
A 3
9.7%
G 2
 
6.5%
K 2
 
6.5%
J 1
 
3.2%
D 1
 
3.2%
F 1
 
3.2%
O 1
 
3.2%
Other values (4) 4
12.9%
Other Punctuation
ValueCountFrequency (%)
· 8
66.7%
& 2
 
16.7%
. 2
 
16.7%
Close Punctuation
ValueCountFrequency (%)
) 398
100.0%
Open Punctuation
ValueCountFrequency (%)
( 398
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3466
80.5%
Common 808
 
18.8%
Latin 31
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
483
 
13.9%
136
 
3.9%
116
 
3.3%
112
 
3.2%
99
 
2.9%
98
 
2.8%
96
 
2.8%
74
 
2.1%
71
 
2.0%
67
 
1.9%
Other values (266) 2114
61.0%
Latin
ValueCountFrequency (%)
C 6
19.4%
T 5
16.1%
E 5
16.1%
A 3
9.7%
G 2
 
6.5%
K 2
 
6.5%
J 1
 
3.2%
D 1
 
3.2%
F 1
 
3.2%
O 1
 
3.2%
Other values (4) 4
12.9%
Common
ValueCountFrequency (%)
) 398
49.3%
( 398
49.3%
· 8
 
1.0%
& 2
 
0.2%
. 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3459
80.3%
ASCII 831
 
19.3%
None 15
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
483
 
14.0%
136
 
3.9%
116
 
3.4%
112
 
3.2%
99
 
2.9%
98
 
2.8%
96
 
2.8%
74
 
2.1%
71
 
2.1%
67
 
1.9%
Other values (265) 2107
60.9%
ASCII
ValueCountFrequency (%)
) 398
47.9%
( 398
47.9%
C 6
 
0.7%
T 5
 
0.6%
E 5
 
0.6%
A 3
 
0.4%
G 2
 
0.2%
K 2
 
0.2%
& 2
 
0.2%
. 2
 
0.2%
Other values (8) 8
 
1.0%
None
ValueCountFrequency (%)
· 8
53.3%
7
46.7%

업종
Categorical

Distinct14
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
기계가스설비공사업
121 
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업
65 
상ㆍ하수도설비공사업
61 
지반조성ㆍ포장공사업
58 
실내건축공사업
43 
Other values (9)
213 

Length

Max length17
Median length13
Mean length10.336898
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상ㆍ하수도설비공사업
2nd row시설물유지관리업
3rd row금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업
4th row기계가스설비공사업
5th row금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업

Common Values

ValueCountFrequency (%)
기계가스설비공사업 121
21.6%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 65
11.6%
상ㆍ하수도설비공사업 61
10.9%
지반조성ㆍ포장공사업 58
10.3%
실내건축공사업 43
 
7.7%
철근ㆍ콘크리트공사업 40
 
7.1%
가스난방공사업 36
 
6.4%
도장ㆍ습식ㆍ방수ㆍ석공사업 35
 
6.2%
조경식재ㆍ시설물공사업 34
 
6.1%
구조물해체ㆍ비계공사업 28
 
5.0%
Other values (4) 40
 
7.1%

Length

2024-02-10T13:05:54.740575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기계가스설비공사업 121
21.6%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 65
11.6%
상ㆍ하수도설비공사업 61
10.9%
지반조성ㆍ포장공사업 58
10.3%
실내건축공사업 43
 
7.7%
철근ㆍ콘크리트공사업 40
 
7.1%
가스난방공사업 36
 
6.4%
도장ㆍ습식ㆍ방수ㆍ석공사업 35
 
6.2%
조경식재ㆍ시설물공사업 34
 
6.1%
구조물해체ㆍ비계공사업 28
 
5.0%
Other values (4) 40
 
7.1%
Distinct400
Distinct (%)71.6%
Missing2
Missing (%)0.4%
Memory size4.5 KiB
2024-02-10T13:05:55.211585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length47
Mean length30.543828
Min length22

Characters and Unicode

Total characters17074
Distinct characters160
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique296 ?
Unique (%)53.0%

Sample

1st row부산광역시 강서구 공항로811번나길 124 (대저2동)
2nd row부산광역시 강서구 공항로239번길 96 (대저2동)
3rd row부산광역시 강서구 공항로239번길 96 (대저2동)
4th row부산광역시 강서구 대저중앙로348번길 94 (대저1동)
5th row부산광역시 강서구 신호산단4로64번길 10 (신호동)
ValueCountFrequency (%)
부산광역시 559
 
18.0%
강서구 559
 
18.0%
대저1동 135
 
4.4%
대저2동 86
 
2.8%
유통단지1로 62
 
2.0%
송정동 52
 
1.7%
41 38
 
1.2%
명지동 36
 
1.2%
강동동 28
 
0.9%
50 26
 
0.8%
Other values (586) 1517
49.0%
2024-02-10T13:05:56.178595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2539
 
14.9%
1 893
 
5.2%
813
 
4.8%
750
 
4.4%
606
 
3.5%
2 599
 
3.5%
597
 
3.5%
574
 
3.4%
564
 
3.3%
563
 
3.3%
Other values (150) 8576
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9846
57.7%
Decimal Number 3342
 
19.6%
Space Separator 2539
 
14.9%
Close Punctuation 561
 
3.3%
Open Punctuation 561
 
3.3%
Other Punctuation 143
 
0.8%
Dash Punctuation 81
 
0.5%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
813
 
8.3%
750
 
7.6%
606
 
6.2%
597
 
6.1%
574
 
5.8%
564
 
5.7%
563
 
5.7%
560
 
5.7%
559
 
5.7%
532
 
5.4%
Other values (132) 3728
37.9%
Decimal Number
ValueCountFrequency (%)
1 893
26.7%
2 599
17.9%
3 378
11.3%
0 271
 
8.1%
5 270
 
8.1%
6 217
 
6.5%
8 206
 
6.2%
4 197
 
5.9%
7 162
 
4.8%
9 149
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 130
90.9%
12
 
8.4%
. 1
 
0.7%
Space Separator
ValueCountFrequency (%)
2539
100.0%
Close Punctuation
ValueCountFrequency (%)
) 561
100.0%
Open Punctuation
ValueCountFrequency (%)
( 561
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 81
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9846
57.7%
Common 7227
42.3%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
813
 
8.3%
750
 
7.6%
606
 
6.2%
597
 
6.1%
574
 
5.8%
564
 
5.7%
563
 
5.7%
560
 
5.7%
559
 
5.7%
532
 
5.4%
Other values (132) 3728
37.9%
Common
ValueCountFrequency (%)
2539
35.1%
1 893
 
12.4%
2 599
 
8.3%
) 561
 
7.8%
( 561
 
7.8%
3 378
 
5.2%
0 271
 
3.7%
5 270
 
3.7%
6 217
 
3.0%
8 206
 
2.9%
Other values (7) 732
 
10.1%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9846
57.7%
ASCII 7216
42.3%
None 12
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2539
35.2%
1 893
 
12.4%
2 599
 
8.3%
) 561
 
7.8%
( 561
 
7.8%
3 378
 
5.2%
0 271
 
3.8%
5 270
 
3.7%
6 217
 
3.0%
8 206
 
2.9%
Other values (7) 721
 
10.0%
Hangul
ValueCountFrequency (%)
813
 
8.3%
750
 
7.6%
606
 
6.2%
597
 
6.1%
574
 
5.8%
564
 
5.7%
563
 
5.7%
560
 
5.7%
559
 
5.7%
532
 
5.4%
Other values (132) 3728
37.9%
None
ValueCountFrequency (%)
12
100.0%

Interactions

2024-02-10T13:05:50.775125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-02-10T13:05:56.463358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.215
업종0.2151.000
2024-02-10T13:05:56.734146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.085
업종0.0851.000

Missing values

2024-02-10T13:05:51.148415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-02-10T13:05:51.486445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호업종영업소재지(도로명주소)
01(유)금문건설상ㆍ하수도설비공사업부산광역시 강서구 공항로811번나길 124 (대저2동)
12(유)한음이엔지시설물유지관리업부산광역시 강서구 공항로239번길 96 (대저2동)
23(유)한음이엔지금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업부산광역시 강서구 공항로239번길 96 (대저2동)
34(주)가람테크기계가스설비공사업부산광역시 강서구 대저중앙로348번길 94 (대저1동)
45(주)가안이엔씨금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업부산광역시 강서구 신호산단4로64번길 10 (신호동)
56(주)가야건설산업철근ㆍ콘크리트공사업부산광역시 강서구 유통단지1로 50, 202동 202호(대저2동, 부산티플렉스)
67(주)가원조경조경식재ㆍ시설물공사업부산광역시 강서구 공항로1309번길 76 (대저1동)
78(주)가진건설철근ㆍ콘크리트공사업부산광역시 강서구 동선길 99-8 (동선동)
89(주)개림산업금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업부산광역시 강서구 울만로25번길 103-9 (대저2동)
910(주)거도산업실내건축공사업부산광역시 강서구 신호산단1로 215, 705호(신호동, 새미래오피스빌딩)
연번상호업종영업소재지(도로명주소)
551552한일방식(주)수중ㆍ준설공사업부산광역시 강서구 미음산단3로 82 (미음동)
552553해진설비가스난방공사업부산광역시 강서구 낙동북로 228(대저1동)
553554형덕산업개발(주)철근ㆍ콘크리트공사업부산광역시 강서구 명지국제2로28번길 3, 903호(명지동, 동건프라자)
554555형덕산업개발(주)지반조성ㆍ포장공사업부산광역시 강서구 명지국제2로28번길 3, 903호(명지동, 동건프라자)
555556호인건설(주)상ㆍ하수도설비공사업부산광역시 강서구 호계로 133, 2층 (죽동동)
556557호인건설(주)기계가스설비공사업부산광역시 강서구 호계로 133, 2층 (죽동동)
557558화복공작소주식회사실내건축공사업부산광역시 강서구 미음산단4로 190 2층 (미음동)
558559효원산업실내건축공사업부산광역시 강서구 녹산산단381로 13 (송정동)
559560효원산업시설물유지관리업부산광역시 강서구 녹산산단381로 13 (송정동)
560561흥국이엔씨(E&C)가스난방공사업부산광역시 강서구 제도로 823(강동동)