Overview

Dataset statistics

Number of variables5
Number of observations213
Missing cells3
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.4 KiB
Average record size in memory40.6 B

Variable types

Categorical1
Text4

Dataset

Description경상남도 내 측량업체 등록 현황입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15056437

Alerts

사무소전화번호 has 3 (1.4%) missing valuesMissing
업등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:19:49.150906
Analysis finished2023-12-11 00:19:49.574701
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
일반측량
159 
공공측량
40 
지적측량
 
14

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지적측량
2nd row지적측량
3rd row지적측량
4th row지적측량
5th row지적측량

Common Values

ValueCountFrequency (%)
일반측량 159
74.6%
공공측량 40
 
18.8%
지적측량 14
 
6.6%

Length

2023-12-11T09:19:49.646066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:19:49.760547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반측량 159
74.6%
공공측량 40
 
18.8%
지적측량 14
 
6.6%

업등록번호
Text

UNIQUE 

Distinct213
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-11T09:19:50.049040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length7.8028169
Min length2

Characters and Unicode

Total characters1662
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique213 ?
Unique (%)100.0%

Sample

1st row02-000311
2nd row02-000312
3rd row02-000281
4th row4800007
5th row02-000266
ValueCountFrequency (%)
02-000311 1
 
0.5%
6042097 1
 
0.5%
04-003837 1
 
0.5%
04-003365 1
 
0.5%
6012052 1
 
0.5%
04-004256 1
 
0.5%
04-003994 1
 
0.5%
04-004290 1
 
0.5%
04-003719 1
 
0.5%
04-003706 1
 
0.5%
Other values (203) 203
95.3%
2023-12-11T09:19:50.466695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 548
33.0%
4 227
13.7%
2 153
 
9.2%
6 134
 
8.1%
3 130
 
7.8%
1 120
 
7.2%
- 115
 
6.9%
7 61
 
3.7%
9 58
 
3.5%
8 54
 
3.2%
Other values (4) 62
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1539
92.6%
Dash Punctuation 115
 
6.9%
Other Letter 8
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 548
35.6%
4 227
14.7%
2 153
 
9.9%
6 134
 
8.7%
3 130
 
8.4%
1 120
 
7.8%
7 61
 
4.0%
9 58
 
3.8%
8 54
 
3.5%
5 54
 
3.5%
Other Letter
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 115
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1654
99.5%
Hangul 8
 
0.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 548
33.1%
4 227
13.7%
2 153
 
9.3%
6 134
 
8.1%
3 130
 
7.9%
1 120
 
7.3%
- 115
 
7.0%
7 61
 
3.7%
9 58
 
3.5%
8 54
 
3.3%
Hangul
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1654
99.5%
Hangul 8
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 548
33.1%
4 227
13.7%
2 153
 
9.3%
6 134
 
8.1%
3 130
 
7.9%
1 120
 
7.3%
- 115
 
7.0%
7 61
 
3.7%
9 58
 
3.5%
8 54
 
3.3%
Hangul
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%
Distinct204
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-11T09:19:50.670995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length8.600939
Min length2

Characters and Unicode

Total characters1832
Distinct characters172
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique195 ?
Unique (%)91.5%

Sample

1st row(주)동영기술단
2nd row(주)우리이엔지건축사사무소
3rd row(주)우신측량토목공사
4th rowSM측량공사
5th row㈜뉴비전네트웍스
ValueCountFrequency (%)
주식회사 56
 
20.7%
주)동영기술단 2
 
0.7%
주)우리이엔지건축사사무소 2
 
0.7%
대양기술단 2
 
0.7%
주)세진 2
 
0.7%
보금기술공사 2
 
0.7%
한성개발공사 2
 
0.7%
주식회사민종합기술단 2
 
0.7%
주)우신측량토목공사 2
 
0.7%
우주종합이엔지 2
 
0.7%
Other values (197) 197
72.7%
2023-12-11T09:19:50.979403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
171
 
9.3%
( 109
 
5.9%
) 109
 
5.9%
99
 
5.4%
80
 
4.4%
80
 
4.4%
62
 
3.4%
60
 
3.3%
59
 
3.2%
55
 
3.0%
Other values (162) 948
51.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1535
83.8%
Open Punctuation 109
 
5.9%
Close Punctuation 109
 
5.9%
Space Separator 59
 
3.2%
Uppercase Letter 13
 
0.7%
Other Symbol 3
 
0.2%
Other Punctuation 2
 
0.1%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
 
11.1%
99
 
6.4%
80
 
5.2%
80
 
5.2%
62
 
4.0%
60
 
3.9%
55
 
3.6%
54
 
3.5%
54
 
3.5%
38
 
2.5%
Other values (147) 782
50.9%
Uppercase Letter
ValueCountFrequency (%)
E 4
30.8%
N 2
15.4%
G 2
15.4%
S 2
15.4%
C 1
 
7.7%
M 1
 
7.7%
L 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
, 1
50.0%
Lowercase Letter
ValueCountFrequency (%)
g 1
50.0%
n 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 109
100.0%
Close Punctuation
ValueCountFrequency (%)
) 109
100.0%
Space Separator
ValueCountFrequency (%)
59
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1538
84.0%
Common 279
 
15.2%
Latin 15
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
 
11.1%
99
 
6.4%
80
 
5.2%
80
 
5.2%
62
 
4.0%
60
 
3.9%
55
 
3.6%
54
 
3.5%
54
 
3.5%
38
 
2.5%
Other values (148) 785
51.0%
Latin
ValueCountFrequency (%)
E 4
26.7%
N 2
13.3%
G 2
13.3%
S 2
13.3%
C 1
 
6.7%
g 1
 
6.7%
n 1
 
6.7%
M 1
 
6.7%
L 1
 
6.7%
Common
ValueCountFrequency (%)
( 109
39.1%
) 109
39.1%
59
21.1%
& 1
 
0.4%
, 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1535
83.8%
ASCII 294
 
16.0%
None 3
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
171
 
11.1%
99
 
6.4%
80
 
5.2%
80
 
5.2%
62
 
4.0%
60
 
3.9%
55
 
3.6%
54
 
3.5%
54
 
3.5%
38
 
2.5%
Other values (147) 782
50.9%
ASCII
ValueCountFrequency (%)
( 109
37.1%
) 109
37.1%
59
20.1%
E 4
 
1.4%
N 2
 
0.7%
G 2
 
0.7%
S 2
 
0.7%
C 1
 
0.3%
& 1
 
0.3%
, 1
 
0.3%
Other values (4) 4
 
1.4%
None
ValueCountFrequency (%)
3
100.0%

사무소전화번호
Text

MISSING 

Distinct199
Distinct (%)94.8%
Missing3
Missing (%)1.4%
Memory size1.8 KiB
2023-12-11T09:19:51.208894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.028571
Min length11

Characters and Unicode

Total characters2526
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)89.5%

Sample

1st row055-356-7667
2nd row055-367-7800
3rd row055-367-9931
4th row055-274-1436
5th row061-285-0181
ValueCountFrequency (%)
055-334-7501 2
 
1.0%
055-384-2507 2
 
1.0%
055-356-7667 2
 
1.0%
055-367-7800 2
 
1.0%
055-903-9934 2
 
1.0%
055-339-6066 2
 
1.0%
055-237-9466 2
 
1.0%
055-365-7500 2
 
1.0%
055-713-1550 2
 
1.0%
055-289-1190 2
 
1.0%
Other values (189) 190
90.5%
2023-12-11T09:19:51.571511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 540
21.4%
- 420
16.6%
0 359
14.2%
3 201
 
8.0%
2 185
 
7.3%
7 167
 
6.6%
6 159
 
6.3%
1 140
 
5.5%
4 123
 
4.9%
9 117
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2106
83.4%
Dash Punctuation 420
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 540
25.6%
0 359
17.0%
3 201
 
9.5%
2 185
 
8.8%
7 167
 
7.9%
6 159
 
7.5%
1 140
 
6.6%
4 123
 
5.8%
9 117
 
5.6%
8 115
 
5.5%
Dash Punctuation
ValueCountFrequency (%)
- 420
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2526
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 540
21.4%
- 420
16.6%
0 359
14.2%
3 201
 
8.0%
2 185
 
7.3%
7 167
 
6.6%
6 159
 
6.3%
1 140
 
5.5%
4 123
 
4.9%
9 117
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2526
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 540
21.4%
- 420
16.6%
0 359
14.2%
3 201
 
8.0%
2 185
 
7.3%
7 167
 
6.6%
6 159
 
6.3%
1 140
 
5.5%
4 123
 
4.9%
9 117
 
4.6%
Distinct205
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-11T09:19:51.887656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length50
Mean length37.084507
Min length23

Characters and Unicode

Total characters7899
Distinct characters244
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)92.5%

Sample

1st row50419 경상남도 밀양시 시청로1길 6, 3층 (내이동)
2nd row50652 경상남도 양산시 물금읍 부산대학로 150, 603호(대한빌딩)
3rd row50653 경상남도 양산시 물금읍 증산역로 149, 7층 703,704호(세영프라자)
4th row51436 경상남도 창원시 의창구 용지로 161,202호 (용호동, 경남빌딩)
5th row50597 경상남도 양산시 물금읍 동중2길 6-27, 1동 414호(에버리치오피스텔)/ 서울 금천구 가산디지털2로 14 707호
ValueCountFrequency (%)
경상남도 213
 
14.4%
창원시 39
 
2.6%
김해시 36
 
2.4%
양산시 30
 
2.0%
의창구 22
 
1.5%
3층 16
 
1.1%
2층 14
 
0.9%
밀양시 13
 
0.9%
성산구 12
 
0.8%
50924 12
 
0.8%
Other values (682) 1069
72.4%
2023-12-11T09:19:52.393325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1516
 
19.2%
5 371
 
4.7%
2 339
 
4.3%
1 335
 
4.2%
0 309
 
3.9%
255
 
3.2%
239
 
3.0%
3 237
 
3.0%
222
 
2.8%
220
 
2.8%
Other values (234) 3856
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3660
46.3%
Decimal Number 2192
27.8%
Space Separator 1516
19.2%
Other Punctuation 148
 
1.9%
Open Punctuation 137
 
1.7%
Close Punctuation 137
 
1.7%
Dash Punctuation 100
 
1.3%
Uppercase Letter 7
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
255
 
7.0%
239
 
6.5%
222
 
6.1%
220
 
6.0%
166
 
4.5%
166
 
4.5%
165
 
4.5%
117
 
3.2%
93
 
2.5%
93
 
2.5%
Other values (210) 1924
52.6%
Decimal Number
ValueCountFrequency (%)
5 371
16.9%
2 339
15.5%
1 335
15.3%
0 309
14.1%
3 237
10.8%
4 185
8.4%
9 130
 
5.9%
6 114
 
5.2%
7 89
 
4.1%
8 83
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
28.6%
S 1
14.3%
B 1
14.3%
J 1
14.3%
K 1
14.3%
T 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 147
99.3%
/ 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
t 1
50.0%
Space Separator
ValueCountFrequency (%)
1516
100.0%
Open Punctuation
ValueCountFrequency (%)
( 137
100.0%
Close Punctuation
ValueCountFrequency (%)
) 137
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4230
53.6%
Hangul 3660
46.3%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
255
 
7.0%
239
 
6.5%
222
 
6.1%
220
 
6.0%
166
 
4.5%
166
 
4.5%
165
 
4.5%
117
 
3.2%
93
 
2.5%
93
 
2.5%
Other values (210) 1924
52.6%
Common
ValueCountFrequency (%)
1516
35.8%
5 371
 
8.8%
2 339
 
8.0%
1 335
 
7.9%
0 309
 
7.3%
3 237
 
5.6%
4 185
 
4.4%
, 147
 
3.5%
( 137
 
3.2%
) 137
 
3.2%
Other values (6) 517
 
12.2%
Latin
ValueCountFrequency (%)
A 2
22.2%
S 1
11.1%
B 1
11.1%
J 1
11.1%
K 1
11.1%
T 1
11.1%
k 1
11.1%
t 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4239
53.7%
Hangul 3660
46.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1516
35.8%
5 371
 
8.8%
2 339
 
8.0%
1 335
 
7.9%
0 309
 
7.3%
3 237
 
5.6%
4 185
 
4.4%
, 147
 
3.5%
( 137
 
3.2%
) 137
 
3.2%
Other values (14) 526
 
12.4%
Hangul
ValueCountFrequency (%)
255
 
7.0%
239
 
6.5%
222
 
6.1%
220
 
6.0%
166
 
4.5%
166
 
4.5%
165
 
4.5%
117
 
3.2%
93
 
2.5%
93
 
2.5%
Other values (210) 1924
52.6%

Missing values

2023-12-11T09:19:49.441369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:19:49.538023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업등록번호업체명사무소전화번호사무소도로명주소
0지적측량02-000311(주)동영기술단055-356-766750419 경상남도 밀양시 시청로1길 6, 3층 (내이동)
1지적측량02-000312(주)우리이엔지건축사사무소055-367-780050652 경상남도 양산시 물금읍 부산대학로 150, 603호(대한빌딩)
2지적측량02-000281(주)우신측량토목공사055-367-993150653 경상남도 양산시 물금읍 증산역로 149, 7층 703,704호(세영프라자)
3지적측량4800007SM측량공사055-274-143651436 경상남도 창원시 의창구 용지로 161,202호 (용호동, 경남빌딩)
4지적측량02-000266㈜뉴비전네트웍스061-285-018150597 경상남도 양산시 물금읍 동중2길 6-27, 1동 414호(에버리치오피스텔)/ 서울 금천구 가산디지털2로 14 707호
5지적측량02-000259우주종합이엔지055-365-750050623 경상남도 양산시 옥곡3길 7-0 (남부동)
6지적측량02-000235주식회사 가온측량설계공사055-264-280052806 경상남도 진주시 솔밭로 77 (상평동), 2층 201호
7지적측량10주식회사 대양기술단055-289-119051442 경상남도 창원시 의창구 신월로 42, 727호 (신월동, 토월복합상가)
8지적측량02-000248주식회사 라인지적측량055-266-355651526 경상남도 창원시 성산구 삼동로128번길 85-0 303호(내동, 엑스포상가)
9지적측량4800008주식회사 랜드에이스055-287-702651526 경상남도 창원시 성산구 삼동로128번길 85,203호 (내동, 엑스포상가)
업종업등록번호업체명사무소전화번호사무소도로명주소
203일반측량04-003202창녕토목설계공사055-532-045950317 경상남도 창녕군 창녕읍 군청길 4, 2층
204일반측량6042113천명기술단055-322-034450930 경상남도 김해시 인제로39번길 20-0 302호(삼정동)
205일반측량04-003224태강토목055-673-306752943 경상남도 고성군 고성읍 성내로 129-0 101(행운빌딩)
206일반측량04-004139태산E&C<NA>52539 경상남도 사천시 용현면 용현3길 44, 404호(신명빌딩)
207일반측량04-002811태성이엔지055-323-987750937 경상남도 김해시 식만로348번길 25-1
208일반측량04-004047하루055-746-030952649 경상남도 진주시 진양호로97번길 19-5, 3층 (평거동)
209일반측량04-002819한맥기술공사(주)055-329-291850914 경상남도 김해시 분성로 371-5, 3층
210일반측량04-003763한양토목설계055-863-135352425 경상남도 남해군 남해읍 망운로9번길 14-0
211일반측량04-003397해들강토목설계055-974-999152225 경상남도 산청군 산청읍 중앙로 119-0
212일반측량04-004098혜동토목설계055-326-742450924 경상남도 김해시 호계로422번길 20, 2층(부원동)