Overview

Dataset statistics

Number of variables5
Number of observations1681
Missing cells14
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory67.4 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상남도에 등록한 종합건설업 업체현황입니다. 경상남도 종합건설업 업체에 대한 데이터로 업종, 업체명, 전화번호, 주소의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3076024/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:33:59.509273
Analysis finished2023-12-12 04:34:00.611402
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct1681
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean841
Minimum1
Maximum1681
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.9 KiB
2023-12-12T13:34:00.697730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile85
Q1421
median841
Q31261
95-th percentile1597
Maximum1681
Range1680
Interquartile range (IQR)840

Descriptive statistics

Standard deviation485.40722
Coefficient of variation (CV)0.57717862
Kurtosis-1.2
Mean841
Median Absolute Deviation (MAD)420
Skewness0
Sum1413721
Variance235620.17
MonotonicityStrictly increasing
2023-12-12T13:34:00.871528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1105 1
 
0.1%
1129 1
 
0.1%
1128 1
 
0.1%
1127 1
 
0.1%
1126 1
 
0.1%
1125 1
 
0.1%
1124 1
 
0.1%
1123 1
 
0.1%
1122 1
 
0.1%
Other values (1671) 1671
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1681 1
0.1%
1680 1
0.1%
1679 1
0.1%
1678 1
0.1%
1677 1
0.1%
1676 1
0.1%
1675 1
0.1%
1674 1
0.1%
1673 1
0.1%
1672 1
0.1%

업종
Categorical

Distinct6
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.3 KiB
건축공사업
690 
토목공사업
566 
토목건축공사업
257 
조경공사업
141 
산업ㆍ환경설비공사업
 
21

Length

Max length10
Median length5
Mean length5.3753718
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토목공사업
2nd row토목공사업
3rd row토목공사업
4th row건축공사업
5th row토목공사업

Common Values

ValueCountFrequency (%)
건축공사업 690
41.0%
토목공사업 566
33.7%
토목건축공사업 257
 
15.3%
조경공사업 141
 
8.4%
산업ㆍ환경설비공사업 21
 
1.2%
산업설비공사업 6
 
0.4%

Length

2023-12-12T13:34:01.030490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:34:01.174750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건축공사업 690
41.0%
토목공사업 566
33.7%
토목건축공사업 257
 
15.3%
조경공사업 141
 
8.4%
산업ㆍ환경설비공사업 21
 
1.2%
산업설비공사업 6
 
0.4%
Distinct1338
Distinct (%)79.6%
Missing0
Missing (%)0.0%
Memory size13.3 KiB
2023-12-12T13:34:01.424931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length7.9607377
Min length4

Characters and Unicode

Total characters13382
Distinct characters304
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1040 ?
Unique (%)61.9%

Sample

1st row(유)거성건설
2nd row(유)계영
3rd row(유)대산건설
4th row(유)대양씨앤씨
5th row(유)대흥종합건설
ValueCountFrequency (%)
정우종합건설(주 5
 
0.3%
정우건설(주 4
 
0.2%
삼원종합건설(주 4
 
0.2%
도원종합건설(주 4
 
0.2%
정인건설(주 3
 
0.2%
일성종합건설(주 3
 
0.2%
관보토건(주 3
 
0.2%
정진종합건설(주 3
 
0.2%
서진산업(주 3
 
0.2%
남명건설(주 3
 
0.2%
Other values (1328) 1646
97.9%
2023-12-12T13:34:01.833058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1670
 
12.5%
) 1624
 
12.1%
( 1624
 
12.1%
1329
 
9.9%
1206
 
9.0%
652
 
4.9%
649
 
4.8%
154
 
1.2%
141
 
1.1%
137
 
1.0%
Other values (294) 4196
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10124
75.7%
Close Punctuation 1624
 
12.1%
Open Punctuation 1624
 
12.1%
Other Symbol 5
 
< 0.1%
Other Punctuation 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1670
16.5%
1329
 
13.1%
1206
 
11.9%
652
 
6.4%
649
 
6.4%
154
 
1.5%
141
 
1.4%
137
 
1.4%
137
 
1.4%
106
 
1.0%
Other values (287) 3943
38.9%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
L 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 1624
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1624
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10129
75.7%
Common 3251
 
24.3%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1670
16.5%
1329
 
13.1%
1206
 
11.9%
652
 
6.4%
649
 
6.4%
154
 
1.5%
141
 
1.4%
137
 
1.4%
137
 
1.4%
106
 
1.0%
Other values (288) 3948
39.0%
Common
ValueCountFrequency (%)
) 1624
50.0%
( 1624
50.0%
. 2
 
0.1%
& 1
 
< 0.1%
Latin
ValueCountFrequency (%)
S 1
50.0%
L 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10124
75.7%
ASCII 3253
 
24.3%
None 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1670
16.5%
1329
 
13.1%
1206
 
11.9%
652
 
6.4%
649
 
6.4%
154
 
1.5%
141
 
1.4%
137
 
1.4%
137
 
1.4%
106
 
1.0%
Other values (287) 3943
38.9%
ASCII
ValueCountFrequency (%)
) 1624
49.9%
( 1624
49.9%
. 2
 
0.1%
S 1
 
< 0.1%
L 1
 
< 0.1%
& 1
 
< 0.1%
None
ValueCountFrequency (%)
5
100.0%
Distinct1348
Distinct (%)80.5%
Missing6
Missing (%)0.4%
Memory size13.3 KiB
2023-12-12T13:34:02.170142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.009552
Min length11

Characters and Unicode

Total characters20116
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1057 ?
Unique (%)63.1%

Sample

1st row055-649-9550
2nd row055-931-0415
3rd row055-761-9820
4th row055-298-9556
5th row055-326-6868
ValueCountFrequency (%)
055-267-4005 5
 
0.3%
055-762-7070 4
 
0.2%
055-963-9336 4
 
0.2%
055-585-8689 4
 
0.2%
055-884-1578 4
 
0.2%
055-931-0091 3
 
0.2%
055-266-1661 3
 
0.2%
055-223-8665 3
 
0.2%
055-325-1961 3
 
0.2%
055-243-3390 3
 
0.2%
Other values (1338) 1639
97.9%
2023-12-12T13:34:02.655384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 4443
22.1%
- 3350
16.7%
0 2883
14.3%
3 1419
 
7.1%
7 1348
 
6.7%
2 1328
 
6.6%
4 1157
 
5.8%
8 1131
 
5.6%
6 1093
 
5.4%
1 1058
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16766
83.3%
Dash Punctuation 3350
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 4443
26.5%
0 2883
17.2%
3 1419
 
8.5%
7 1348
 
8.0%
2 1328
 
7.9%
4 1157
 
6.9%
8 1131
 
6.7%
6 1093
 
6.5%
1 1058
 
6.3%
9 906
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 3350
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20116
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 4443
22.1%
- 3350
16.7%
0 2883
14.3%
3 1419
 
7.1%
7 1348
 
6.7%
2 1328
 
6.6%
4 1157
 
5.8%
8 1131
 
5.6%
6 1093
 
5.4%
1 1058
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20116
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 4443
22.1%
- 3350
16.7%
0 2883
14.3%
3 1419
 
7.1%
7 1348
 
6.7%
2 1328
 
6.6%
4 1157
 
5.8%
8 1131
 
5.6%
6 1093
 
5.4%
1 1058
 
5.3%
Distinct1353
Distinct (%)80.9%
Missing8
Missing (%)0.5%
Memory size13.3 KiB
2023-12-12T13:34:03.142829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length47
Mean length29.111775
Min length18

Characters and Unicode

Total characters48704
Distinct characters403
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1066 ?
Unique (%)63.7%

Sample

1st row경상남도 통영시 동호안길 86 1동 402호, 동원비치맨션상가 (동호동)
2nd row경상남도 합천군 대병면 서부로 1779-2
3rd row경상남도 함양군 함양읍 삼휴길 29-7
4th row경상남도 창원시 의창구 의안로12번길 44 (소답동)
5th row경상남도 김해시 평전로171번길 6 (내동)
ValueCountFrequency (%)
경상남도 1667
 
16.4%
창원시 420
 
4.1%
진주시 273
 
2.7%
김해시 187
 
1.8%
179
 
1.8%
2층 172
 
1.7%
의창구 141
 
1.4%
성산구 126
 
1.2%
양산시 96
 
0.9%
3층 84
 
0.8%
Other values (2401) 6804
67.0%
2023-12-12T13:34:03.755033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8476
 
17.4%
1930
 
4.0%
1890
 
3.9%
1757
 
3.6%
1718
 
3.5%
1 1664
 
3.4%
1394
 
2.9%
2 1347
 
2.8%
1312
 
2.7%
1268
 
2.6%
Other values (393) 25948
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28647
58.8%
Space Separator 8476
 
17.4%
Decimal Number 7932
 
16.3%
Other Punctuation 1148
 
2.4%
Close Punctuation 1051
 
2.2%
Open Punctuation 1049
 
2.2%
Dash Punctuation 398
 
0.8%
Uppercase Letter 2
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1930
 
6.7%
1890
 
6.6%
1757
 
6.1%
1718
 
6.0%
1394
 
4.9%
1312
 
4.6%
1268
 
4.4%
824
 
2.9%
754
 
2.6%
647
 
2.3%
Other values (372) 15153
52.9%
Decimal Number
ValueCountFrequency (%)
1 1664
21.0%
2 1347
17.0%
0 882
11.1%
3 859
10.8%
4 718
9.1%
5 642
 
8.1%
6 553
 
7.0%
7 472
 
6.0%
9 400
 
5.0%
8 395
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 1066
92.9%
68
 
5.9%
· 11
 
1.0%
. 3
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
8476
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1051
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1049
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 398
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28647
58.8%
Common 20054
41.2%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1930
 
6.7%
1890
 
6.6%
1757
 
6.1%
1718
 
6.0%
1394
 
4.9%
1312
 
4.6%
1268
 
4.4%
824
 
2.9%
754
 
2.6%
647
 
2.3%
Other values (372) 15153
52.9%
Common
ValueCountFrequency (%)
8476
42.3%
1 1664
 
8.3%
2 1347
 
6.7%
, 1066
 
5.3%
) 1051
 
5.2%
( 1049
 
5.2%
0 882
 
4.4%
3 859
 
4.3%
4 718
 
3.6%
5 642
 
3.2%
Other values (8) 2300
 
11.5%
Latin
ValueCountFrequency (%)
T 1
33.3%
B 1
33.3%
e 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28647
58.8%
ASCII 19978
41.0%
None 79
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8476
42.4%
1 1664
 
8.3%
2 1347
 
6.7%
, 1066
 
5.3%
) 1051
 
5.3%
( 1049
 
5.3%
0 882
 
4.4%
3 859
 
4.3%
4 718
 
3.6%
5 642
 
3.2%
Other values (9) 2224
 
11.1%
Hangul
ValueCountFrequency (%)
1930
 
6.7%
1890
 
6.6%
1757
 
6.1%
1718
 
6.0%
1394
 
4.9%
1312
 
4.6%
1268
 
4.4%
824
 
2.9%
754
 
2.6%
647
 
2.3%
Other values (372) 15153
52.9%
None
ValueCountFrequency (%)
68
86.1%
· 11
 
13.9%

Interactions

2023-12-12T13:34:00.062353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:34:03.855893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종
번호1.0000.108
업종0.1081.000
2023-12-12T13:34:03.962674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종
번호1.0000.057
업종0.0571.000

Missing values

2023-12-12T13:34:00.250213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:34:00.411176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T13:34:00.555188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호업종업체명전화번호도로명주소
01토목공사업(유)거성건설055-649-9550경상남도 통영시 동호안길 86 1동 402호, 동원비치맨션상가 (동호동)
12토목공사업(유)계영055-931-0415경상남도 합천군 대병면 서부로 1779-2
23토목공사업(유)대산건설055-761-9820경상남도 함양군 함양읍 삼휴길 29-7
34건축공사업(유)대양씨앤씨055-298-9556경상남도 창원시 의창구 의안로12번길 44 (소답동)
45토목공사업(유)대흥종합건설055-326-6868경상남도 김해시 평전로171번길 6 (내동)
56토목공사업(유)동남055-884-7220경상남도 하동군 하동읍 중앙로 99
67조경공사업(유)동영건설055-672-2233경상남도 고성군 상리면 망봉로 127
78토목건축공사업(유)동우종합건설055-854-9166경상남도 사천시 사천읍 수양로 123-2
89토목공사업(유)두경건설055-852-9955경상남도 사천시 서포면 구평1로 67
910토목건축공사업(유)비룡종합건설055-963-3385경상남도 함양군 함양읍 함양초등길 35-3
번호업종업체명전화번호도로명주소
16711672조경공사업효성종합건설(주)055-753-8808경상남도 진주시 돗골로 3, 3층(상평동)
16721673건축공사업효원종합건설(주)055-256-8500경상남도 창원시 의창구 읍성로101번길 6, 2층(소답동)
16731674토목공사업효진종합건설(주)055-931-9953경상남도 합천군 합천읍 장수로 66
16741675토목공사업흥창건설(주)055-759-5902경상남도 진주시 하대로 85, 202호(하대동, 럭스빌)
16751676건축공사업흥창건설(주)055-759-5902경상남도 진주시 하대로 85, 202호(하대동, 럭스빌)
16761677토목건축공사업흥한건설(주)055-752-5551경상남도 진주시 진양호로 544, 2층, 3층(동성동)
16771678토목건축공사업흥한산업(주)055-757-7100경상남도 진주시 영천강로 172 , 상가 101호 (충무공동, 트렌젠웰가)
16781679토목건축공사업흥한주택종합건설(주)055-742-0001경상남도 진주시 신안로 94, 105호(신안동, 흥한스위트)
16791680토목공사업희선종합건설(주)055-247-4648경상남도 창원시 마산합포구 중앙동로 20, 2층(중앙동2가)
16801681건축공사업힘찬종합건설(주)055-344-2234경상남도 김해시 주촌면 선지로101번길 20