Overview

Dataset statistics

Number of variables5
Number of observations259
Missing cells6
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.2 KiB
Average record size in memory40.5 B

Variable types

Categorical1
Text4

Dataset

Description경상남도 내 측량업체 등록 현황입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15056437

Alerts

사무소전화번호 has 6 (2.3%) missing valuesMissing
업등록번호 has unique valuesUnique

Reproduction

Analysis started2024-04-18 08:54:09.362747
Analysis finished2024-04-18 08:54:11.058860
Duration1.7 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
일반측량
181 
공공측량
60 
지적측량
 
18

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지적측량
2nd row지적측량
3rd row지적측량
4th row지적측량
5th row지적측량

Common Values

ValueCountFrequency (%)
일반측량 181
69.9%
공공측량 60
 
23.2%
지적측량 18
 
6.9%

Length

2024-04-18T17:54:11.215853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T17:54:11.475187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반측량 181
69.9%
공공측량 60
 
23.2%
지적측량 18
 
6.9%

업등록번호
Text

UNIQUE 

Distinct259
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-18T17:54:12.221910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters2331
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique259 ?
Unique (%)100.0%

Sample

1st row02-000126
2nd row02-000116
3rd row02-000311
4th row02-000372
5th row02-000259
ValueCountFrequency (%)
02-000126 1
 
0.4%
04-002526 1
 
0.4%
04-004720 1
 
0.4%
04-003911 1
 
0.4%
04-004663 1
 
0.4%
04-002185 1
 
0.4%
04-003395 1
 
0.4%
04-003397 1
 
0.4%
04-004905 1
 
0.4%
04-002152 1
 
0.4%
Other values (249) 249
96.1%
2024-04-18T17:54:13.522128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 898
38.5%
4 298
 
12.8%
- 259
 
11.1%
2 195
 
8.4%
3 182
 
7.8%
1 150
 
6.4%
5 106
 
4.5%
6 69
 
3.0%
7 63
 
2.7%
8 56
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2072
88.9%
Dash Punctuation 259
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 898
43.3%
4 298
 
14.4%
2 195
 
9.4%
3 182
 
8.8%
1 150
 
7.2%
5 106
 
5.1%
6 69
 
3.3%
7 63
 
3.0%
8 56
 
2.7%
9 55
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 259
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2331
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 898
38.5%
4 298
 
12.8%
- 259
 
11.1%
2 195
 
8.4%
3 182
 
7.8%
1 150
 
6.4%
5 106
 
4.5%
6 69
 
3.0%
7 63
 
2.7%
8 56
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2331
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 898
38.5%
4 298
 
12.8%
- 259
 
11.1%
2 195
 
8.4%
3 182
 
7.8%
1 150
 
6.4%
5 106
 
4.5%
6 69
 
3.0%
7 63
 
2.7%
8 56
 
2.4%
Distinct244
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-18T17:54:14.241841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length8.8803089
Min length2

Characters and Unicode

Total characters2300
Distinct characters185
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)88.8%

Sample

1st row주식회사 한성개발공사
2nd row주식회사한성기술단
3rd row(주)동영기술단
4th row(주)하나서베이마스터
5th row우주종합이엔지
ValueCountFrequency (%)
주식회사 89
 
25.0%
4
 
1.1%
보금기술공사 3
 
0.8%
주)메타이엔지 2
 
0.6%
씨케이이앤씨 2
 
0.6%
한성개발공사 2
 
0.6%
주)한토공간기술 2
 
0.6%
이도 2
 
0.6%
주)태일 2
 
0.6%
민종합기술단 2
 
0.6%
Other values (240) 246
69.1%
2024-04-18T17:54:15.410274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
209
 
9.1%
141
 
6.1%
( 116
 
5.0%
) 116
 
5.0%
97
 
4.2%
95
 
4.1%
93
 
4.0%
91
 
4.0%
90
 
3.9%
67
 
2.9%
Other values (175) 1185
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1952
84.9%
Open Punctuation 116
 
5.0%
Close Punctuation 116
 
5.0%
Space Separator 97
 
4.2%
Uppercase Letter 14
 
0.6%
Other Symbol 2
 
0.1%
Lowercase Letter 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
209
 
10.7%
141
 
7.2%
95
 
4.9%
93
 
4.8%
91
 
4.7%
90
 
4.6%
67
 
3.4%
57
 
2.9%
56
 
2.9%
56
 
2.9%
Other values (162) 997
51.1%
Uppercase Letter
ValueCountFrequency (%)
E 4
28.6%
N 3
21.4%
G 3
21.4%
S 2
14.3%
M 1
 
7.1%
L 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
n 1
50.0%
g 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 116
100.0%
Space Separator
ValueCountFrequency (%)
97
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1954
85.0%
Common 330
 
14.3%
Latin 16
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
209
 
10.7%
141
 
7.2%
95
 
4.9%
93
 
4.8%
91
 
4.7%
90
 
4.6%
67
 
3.4%
57
 
2.9%
56
 
2.9%
56
 
2.9%
Other values (163) 999
51.1%
Latin
ValueCountFrequency (%)
E 4
25.0%
N 3
18.8%
G 3
18.8%
S 2
12.5%
M 1
 
6.2%
L 1
 
6.2%
n 1
 
6.2%
g 1
 
6.2%
Common
ValueCountFrequency (%)
( 116
35.2%
) 116
35.2%
97
29.4%
, 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1952
84.9%
ASCII 346
 
15.0%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
209
 
10.7%
141
 
7.2%
95
 
4.9%
93
 
4.8%
91
 
4.7%
90
 
4.6%
67
 
3.4%
57
 
2.9%
56
 
2.9%
56
 
2.9%
Other values (162) 997
51.1%
ASCII
ValueCountFrequency (%)
( 116
33.5%
) 116
33.5%
97
28.0%
E 4
 
1.2%
N 3
 
0.9%
G 3
 
0.9%
S 2
 
0.6%
M 1
 
0.3%
, 1
 
0.3%
L 1
 
0.3%
Other values (2) 2
 
0.6%
None
ValueCountFrequency (%)
2
100.0%

사무소전화번호
Text

MISSING 

Distinct231
Distinct (%)91.3%
Missing6
Missing (%)2.3%
Memory size2.2 KiB
2024-04-18T17:54:16.000858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.059289
Min length12

Characters and Unicode

Total characters3051
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique210 ?
Unique (%)83.0%

Sample

1st row055-334-7501
2nd row055-334-2824
3rd row055-356-7667
4th row055-723-3015
5th row055-365-7500
ValueCountFrequency (%)
055-352-9400 3
 
1.2%
055-673-6880 2
 
0.8%
055-277-6003 2
 
0.8%
055-334-7501 2
 
0.8%
055-288-8886 2
 
0.8%
055-289-1190 2
 
0.8%
055-713-1550 2
 
0.8%
070-4048-7887 2
 
0.8%
053-555-1424 2
 
0.8%
055-314-8228 2
 
0.8%
Other values (221) 232
91.7%
2024-04-18T17:54:16.873531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 629
20.6%
- 506
16.6%
0 445
14.6%
3 251
 
8.2%
2 203
 
6.7%
7 202
 
6.6%
6 195
 
6.4%
1 162
 
5.3%
8 158
 
5.2%
9 150
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2545
83.4%
Dash Punctuation 506
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 629
24.7%
0 445
17.5%
3 251
 
9.9%
2 203
 
8.0%
7 202
 
7.9%
6 195
 
7.7%
1 162
 
6.4%
8 158
 
6.2%
9 150
 
5.9%
4 150
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 506
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3051
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 629
20.6%
- 506
16.6%
0 445
14.6%
3 251
 
8.2%
2 203
 
6.7%
7 202
 
6.6%
6 195
 
6.4%
1 162
 
5.3%
8 158
 
5.2%
9 150
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3051
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 629
20.6%
- 506
16.6%
0 445
14.6%
3 251
 
8.2%
2 203
 
6.7%
7 202
 
6.6%
6 195
 
6.4%
1 162
 
5.3%
8 158
 
5.2%
9 150
 
4.9%
Distinct247
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-18T17:54:17.697416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length54
Mean length37.656371
Min length19

Characters and Unicode

Total characters9753
Distinct characters250
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique236 ?
Unique (%)91.1%

Sample

1st row50930 경상남도 김해시 분성로 524(어방동)
2nd row50924 경상남도 김해시 김해대로2371번길 8-25,3층
3rd row경상남도 밀양시 시청로1길 6, 3층(내이동) 우)50419
4th row경상남도 김해시 김해대로2453번길 27 (삼정동),1층 우)50934
5th row경상남도 양산시 물금읍 백호로 643층 306호(센텀시티프라자) 우)50613
ValueCountFrequency (%)
경상남도 259
 
16.2%
창원시 49
 
3.1%
김해시 40
 
2.5%
양산시 29
 
1.8%
밀양시 24
 
1.5%
성산구 21
 
1.3%
의창구 17
 
1.1%
진주시 14
 
0.9%
우)50419 13
 
0.8%
물금읍 12
 
0.7%
Other values (743) 1124
70.2%
2024-04-18T17:54:18.902543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1721
 
17.6%
1 426
 
4.4%
5 386
 
4.0%
) 386
 
4.0%
2 385
 
3.9%
0 334
 
3.4%
301
 
3.1%
279
 
2.9%
267
 
2.7%
267
 
2.7%
Other values (240) 5001
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4714
48.3%
Decimal Number 2454
25.2%
Space Separator 1721
 
17.6%
Close Punctuation 386
 
4.0%
Other Punctuation 197
 
2.0%
Open Punctuation 185
 
1.9%
Dash Punctuation 85
 
0.9%
Uppercase Letter 9
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
301
 
6.4%
279
 
5.9%
267
 
5.7%
267
 
5.7%
223
 
4.7%
212
 
4.5%
207
 
4.4%
204
 
4.3%
138
 
2.9%
135
 
2.9%
Other values (217) 2481
52.6%
Decimal Number
ValueCountFrequency (%)
1 426
17.4%
5 386
15.7%
2 385
15.7%
0 334
13.6%
3 251
10.2%
4 207
8.4%
9 149
 
6.1%
6 136
 
5.5%
8 97
 
4.0%
7 83
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
A 3
33.3%
B 2
22.2%
I 1
 
11.1%
T 1
 
11.1%
S 1
 
11.1%
J 1
 
11.1%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
t 1
50.0%
Space Separator
ValueCountFrequency (%)
1721
100.0%
Close Punctuation
ValueCountFrequency (%)
) 386
100.0%
Other Punctuation
ValueCountFrequency (%)
, 197
100.0%
Open Punctuation
ValueCountFrequency (%)
( 185
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5028
51.6%
Hangul 4714
48.3%
Latin 11
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
301
 
6.4%
279
 
5.9%
267
 
5.7%
267
 
5.7%
223
 
4.7%
212
 
4.5%
207
 
4.4%
204
 
4.3%
138
 
2.9%
135
 
2.9%
Other values (217) 2481
52.6%
Common
ValueCountFrequency (%)
1721
34.2%
1 426
 
8.5%
5 386
 
7.7%
) 386
 
7.7%
2 385
 
7.7%
0 334
 
6.6%
3 251
 
5.0%
4 207
 
4.1%
, 197
 
3.9%
( 185
 
3.7%
Other values (5) 550
 
10.9%
Latin
ValueCountFrequency (%)
A 3
27.3%
B 2
18.2%
I 1
 
9.1%
T 1
 
9.1%
S 1
 
9.1%
J 1
 
9.1%
k 1
 
9.1%
t 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5039
51.7%
Hangul 4714
48.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1721
34.2%
1 426
 
8.5%
5 386
 
7.7%
) 386
 
7.7%
2 385
 
7.6%
0 334
 
6.6%
3 251
 
5.0%
4 207
 
4.1%
, 197
 
3.9%
( 185
 
3.7%
Other values (13) 561
 
11.1%
Hangul
ValueCountFrequency (%)
301
 
6.4%
279
 
5.9%
267
 
5.7%
267
 
5.7%
223
 
4.7%
212
 
4.5%
207
 
4.4%
204
 
4.3%
138
 
2.9%
135
 
2.9%
Other values (217) 2481
52.6%

Missing values

2024-04-18T17:54:10.958151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업등록번호업체명사무소전화번호사무소도로명주소
0지적측량02-000126주식회사 한성개발공사055-334-750150930 경상남도 김해시 분성로 524(어방동)
1지적측량02-000116주식회사한성기술단055-334-282450924 경상남도 김해시 김해대로2371번길 8-25,3층
2지적측량02-000311(주)동영기술단055-356-7667경상남도 밀양시 시청로1길 6, 3층(내이동) 우)50419
3지적측량02-000372(주)하나서베이마스터055-723-3015경상남도 김해시 김해대로2453번길 27 (삼정동),1층 우)50934
4지적측량02-000259우주종합이엔지055-365-7500경상남도 양산시 물금읍 백호로 643층 306호(센텀시티프라자) 우)50613
5지적측량02-000312(주)우리이엔지건축사사무소055-367-7800경상남도 양산시 물금읍 부산대학로 150,603호(대한빌딩) 우)50652
6지적측량02-000281(주)우신측량토목공사055-367-9931경상남도 양산시 물금읍 증산역로 153209호(정우프라자) 우)50653
7지적측량02-000161(주) 민종합기술단055-384-2507경상남도 양산시 물금읍 청운로 345, 702호(캠퍼스프라자) 우)50611
8지적측량02-000235주식회사 가온측량설계공사055-264-2800경상남도 진주시 도동로248번길 25 (하대동),2층 우)52767
9지적측량02-000255(주)한토공간기술055-790-9501경상남도 진주시 동부로169번길 12 (충무공동),윙스타워 A동 1302호 우)52818
업종업등록번호업체명사무소전화번호사무소도로명주소
249일반측량04-002156(주)모던055-962-1185경상남도 함양군 함양읍 함양로 1122-1, 3층
250일반측량04-002172(주)신원055-964-0491경상남도 함양군 함양읍 함양로 1245
251일반측량04-004433지피에스토목설계사무소055-962-6479경상남도 함양군 함양읍 함양초등길 8
252일반측량04-002221주식회사 이든055-934-1125경상남도 합천군 대양면 동부로 21-10,1층 우)50239
253일반측량04-004480주식회사 수성이엔씨055-286-8762경상남도 합천군 삼가면 삼가중앙1길 39,3층 우)50222
254일반측량04-002182(주)대운이엔씨055-746-9155경상남도 합천군 용주면 공암길 236-1 우)50214
255일반측량04-004199주식회사 태림055-931-7131경상남도 합천군 합천읍 동서로 118-1,1층 우)50238
256일반측량04-003704주식회사 세원055-794-2995경상남도 합천군 합천읍 서산길 29-0
257일반측량04-002253(주)한남055-933-8322경상남도 합천군 합천읍 중앙로 21
258일반측량04-002194(주)지성이엔지055-931-8398경상남도 합천군 합천읍 핫들2로 11-5,2층 우)50231