Overview

Dataset statistics

Number of variables4
Number of observations369
Missing cells0
Missing cells (%)0.0%
Duplicate rows10
Duplicate rows (%)2.7%
Total size in memory11.7 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description도로명주소 시행에 따른 국민이용 활성화를 위한 안내도 배부 등을 담당하는 안내의 집 운영 현황 및 LX대한지적공사의 지사정보 제공
Author한국국토정보공사
URLhttps://www.data.go.kr/data/15003908/fileData.do

Alerts

Dataset has 10 (2.7%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-16 04:20:32.858280
Analysis finished2024-03-16 04:20:33.190185
Duration0.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

본부별
Categorical

Distinct19
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
경기지역본부
38 
대구경북지역본부
35 
대전세종충남지역본부
31 
광주전남지역본부
29 
울산.경상남도본부
28 
Other values (14)
208 

Length

Max length10
Median length9
Mean length7.2195122
Min length2

Unique

Unique3 ?
Unique (%)0.8%

Sample

1st row본사
2nd row국토정보교육원
3rd row공간정보연구원
4th row서울지역본부
5th row서울지역본부

Common Values

ValueCountFrequency (%)
경기지역본부 38
10.3%
대구경북지역본부 35
9.5%
대전세종충남지역본부 31
 
8.4%
광주전남지역본부 29
 
7.9%
울산.경상남도본부 28
 
7.6%
강원지역본부 28
 
7.6%
경남지역본부 27
 
7.3%
서울지역본부 25
 
6.8%
전북지역본부 23
 
6.2%
경기남부지역본부 22
 
6.0%
Other values (9) 83
22.5%

Length

2024-03-16T13:20:33.277120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기지역본부 38
10.3%
대구경북지역본부 35
9.5%
대전세종충남지역본부 31
 
8.4%
광주전남지역본부 29
 
7.9%
울산.경상남도본부 28
 
7.6%
강원지역본부 28
 
7.6%
경남지역본부 27
 
7.3%
서울지역본부 25
 
6.8%
전북지역본부 23
 
6.2%
경기남부지역본부 22
 
6.0%
Other values (9) 83
22.5%
Distinct280
Distinct (%)75.9%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-16T13:20:33.537885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length5.3550136
Min length3

Characters and Unicode

Total characters1976
Distinct characters152
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)63.7%

Sample

1st row한국국토정보공사
2nd row국토정보교육원
3rd row공간정보연구원
4th row서울지역본부
5th row지적사업처
ValueCountFrequency (%)
공간정보사업처 25
 
6.8%
지적사업처 13
 
3.5%
국토정보사업처 8
 
2.2%
지적공간사업처 3
 
0.8%
사업처 3
 
0.8%
운영지원처 3
 
0.8%
청주서부지사 3
 
0.8%
성남지사 2
 
0.5%
양주지사 2
 
0.5%
안성지사 2
 
0.5%
Other values (271) 306
82.7%
2024-03-16T13:20:33.934361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
350
 
17.7%
321
 
16.2%
67
 
3.4%
56
 
2.8%
55
 
2.8%
48
 
2.4%
42
 
2.1%
41
 
2.1%
40
 
2.0%
35
 
1.8%
Other values (142) 921
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1958
99.1%
Other Punctuation 16
 
0.8%
Decimal Number 1
 
0.1%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
350
 
17.9%
321
 
16.4%
67
 
3.4%
56
 
2.9%
55
 
2.8%
48
 
2.5%
42
 
2.1%
41
 
2.1%
40
 
2.0%
35
 
1.8%
Other values (139) 903
46.1%
Other Punctuation
ValueCountFrequency (%)
. 16
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1958
99.1%
Common 18
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
350
 
17.9%
321
 
16.4%
67
 
3.4%
56
 
2.9%
55
 
2.8%
48
 
2.5%
42
 
2.1%
41
 
2.1%
40
 
2.0%
35
 
1.8%
Other values (139) 903
46.1%
Common
ValueCountFrequency (%)
. 16
88.9%
1 1
 
5.6%
1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1958
99.1%
ASCII 18
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
350
 
17.9%
321
 
16.4%
67
 
3.4%
56
 
2.9%
55
 
2.8%
48
 
2.5%
42
 
2.1%
41
 
2.1%
40
 
2.0%
35
 
1.8%
Other values (139) 903
46.1%
ASCII
ValueCountFrequency (%)
. 16
88.9%
1 1
 
5.6%
1
 
5.6%
Distinct242
Distinct (%)65.6%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-16T13:20:34.270009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length32
Mean length26.826558
Min length18

Characters and Unicode

Total characters9899
Distinct characters229
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)49.1%

Sample

1st row(548-70) 전라북도 전주시 덕진구 기지로 120
2nd row(325-22) 충청남도 공주시 사곡면 연수단지길 182
3rd row(553-65) 전라북도 완주군 이서면 지사제2로 42
4th row(060-53) 서울특별시 강남구 언주로 703
5th row(060-53) 서울특별시 강남구 언주로 703
ValueCountFrequency (%)
경기도 70
 
3.6%
경상남도 27
 
1.4%
서울특별시 25
 
1.3%
강원도 25
 
1.3%
경상북도 24
 
1.2%
경남 24
 
1.2%
전라북도 24
 
1.2%
충청북도 21
 
1.1%
전라남도 20
 
1.0%
충청남도 18
 
0.9%
Other values (909) 1689
85.9%
2024-03-16T13:20:34.971450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1598
 
16.1%
1 473
 
4.8%
2 375
 
3.8%
( 369
 
3.7%
) 369
 
3.7%
- 333
 
3.4%
3 314
 
3.2%
5 305
 
3.1%
0 278
 
2.8%
277
 
2.8%
Other values (219) 5208
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4374
44.2%
Decimal Number 2853
28.8%
Space Separator 1598
 
16.1%
Open Punctuation 369
 
3.7%
Close Punctuation 369
 
3.7%
Dash Punctuation 333
 
3.4%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
277
 
6.3%
259
 
5.9%
244
 
5.6%
172
 
3.9%
156
 
3.6%
140
 
3.2%
115
 
2.6%
109
 
2.5%
109
 
2.5%
96
 
2.2%
Other values (203) 2697
61.7%
Decimal Number
ValueCountFrequency (%)
1 473
16.6%
2 375
13.1%
3 314
11.0%
5 305
10.7%
0 278
9.7%
4 275
9.6%
6 255
8.9%
8 220
7.7%
7 210
7.4%
9 148
 
5.2%
Other Punctuation
ValueCountFrequency (%)
· 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
1598
100.0%
Open Punctuation
ValueCountFrequency (%)
( 369
100.0%
Close Punctuation
ValueCountFrequency (%)
) 369
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 333
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5525
55.8%
Hangul 4374
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
277
 
6.3%
259
 
5.9%
244
 
5.6%
172
 
3.9%
156
 
3.6%
140
 
3.2%
115
 
2.6%
109
 
2.5%
109
 
2.5%
96
 
2.2%
Other values (203) 2697
61.7%
Common
ValueCountFrequency (%)
1598
28.9%
1 473
 
8.6%
2 375
 
6.8%
( 369
 
6.7%
) 369
 
6.7%
- 333
 
6.0%
3 314
 
5.7%
5 305
 
5.5%
0 278
 
5.0%
4 275
 
5.0%
Other values (6) 836
15.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5523
55.8%
Hangul 4374
44.2%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1598
28.9%
1 473
 
8.6%
2 375
 
6.8%
( 369
 
6.7%
) 369
 
6.7%
- 333
 
6.0%
3 314
 
5.7%
5 305
 
5.5%
0 278
 
5.0%
4 275
 
5.0%
Other values (5) 834
15.1%
Hangul
ValueCountFrequency (%)
277
 
6.3%
259
 
5.9%
244
 
5.6%
172
 
3.9%
156
 
3.6%
140
 
3.2%
115
 
2.6%
109
 
2.5%
109
 
2.5%
96
 
2.2%
Other values (203) 2697
61.7%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct281
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-16T13:20:35.238874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length12.108401
Min length11

Characters and Unicode

Total characters4468
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique225 ?
Unique (%)61.0%

Sample

1st row063-713-1800
2nd row041-401-0200
3rd row063-710-0300
4th row02-6937-2001
5th row02-6937-2020
ValueCountFrequency (%)
051-553-7704 7
 
1.9%
053-714-7851 6
 
1.6%
031-290-4401 6
 
1.6%
042-718-4039 5
 
1.4%
064-801-6600 4
 
1.1%
02-6937-2046 4
 
1.1%
043-710-4200 4
 
1.1%
031-250-0923 4
 
1.1%
033-250-5321 4
 
1.1%
063-240-2740 4
 
1.1%
Other values (271) 321
87.0%
2024-03-16T13:20:35.724854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 911
20.4%
- 737
16.5%
3 440
9.8%
5 417
9.3%
1 364
 
8.1%
7 326
 
7.3%
4 323
 
7.2%
2 290
 
6.5%
8 259
 
5.8%
6 258
 
5.8%
Other values (2) 143
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3724
83.3%
Dash Punctuation 737
 
16.5%
Math Symbol 7
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 911
24.5%
3 440
11.8%
5 417
11.2%
1 364
 
9.8%
7 326
 
8.8%
4 323
 
8.7%
2 290
 
7.8%
8 259
 
7.0%
6 258
 
6.9%
9 136
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 737
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4468
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 911
20.4%
- 737
16.5%
3 440
9.8%
5 417
9.3%
1 364
 
8.1%
7 326
 
7.3%
4 323
 
7.2%
2 290
 
6.5%
8 259
 
5.8%
6 258
 
5.8%
Other values (2) 143
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4468
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 911
20.4%
- 737
16.5%
3 440
9.8%
5 417
9.3%
1 364
 
8.1%
7 326
 
7.3%
4 323
 
7.2%
2 290
 
6.5%
8 259
 
5.8%
6 258
 
5.8%
Other values (2) 143
 
3.2%

Missing values

2024-03-16T13:20:33.084754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:20:33.158636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

본부별지 사 별주 소전화번호
0본사한국국토정보공사(548-70) 전라북도 전주시 덕진구 기지로 120063-713-1800
1국토정보교육원국토정보교육원(325-22) 충청남도 공주시 사곡면 연수단지길 182041-401-0200
2공간정보연구원공간정보연구원(553-65) 전라북도 완주군 이서면 지사제2로 42063-710-0300
3서울지역본부서울지역본부(060-53) 서울특별시 강남구 언주로 70302-6937-2001
4서울지역본부지적사업처(060-53) 서울특별시 강남구 언주로 70302-6937-2020
5서울지역본부공간정보사업처(060-53) 서울특별시 강남구 언주로 70302-6937-2046
6서울지역본부공간정보사업처(060-53) 서울특별시 강남구 언주로 70302-6937-2046
7서울지역본부국토정보사업처(060-53) 서울특별시 강남구 언주로 70302-6937-2046
8서울지역본부서울중부지사(028-50) 서울특별시 성북구 안암동1가 고려대로7다길 802-6937-2330
9서울지역본부서울동부지사(047-99) 서울특별시 성동구 광나루로 320-202-6937-2270
본부별지 사 별주 소전화번호
359경남지역본부산청지사(522-26) 경상남도 산청군 산청읍 웅석봉로62번길 8055-822-5700
360경남지역본부함양지사(500-43) 경상남도 함양군 함양읍 함양남서로 1219055-820-5880
361경남지역본부거창지사(501-40) 경상남도 거창군 거창읍 중앙로 4055-808-5650
362경남지역본부합천지사(502-31) 경상남도 합천군 합천읍 동서로 141-4055-816-8770
363제주지역본부제주지역본부(631-23) 제주특별자치도 제주시 신대로 118064-801-6600
364제주지역본부지적사업처(631-23) 제주특별자치도 제주시 신대로 118064-801-6600
365제주지역본부공간정보사업처(631-23) 제주특별자치도 제주시 신대로 118064-801-6600
366제주지역본부지적공간사업처(631-23) 제주특별자치도 제주시 신대로 118064-801-6600
367제주지역본부제주지사(631-23) 제주특별자치도 제주시 신대로 118064-801-6650
368제주지역본부서귀포지사(635-68) 제주특별자치도 서귀포시 서호중로 71064-801-6730

Duplicate rows

Most frequently occurring

본부별지 사 별주 소전화번호# duplicates
0경기지역본부공간정보사업처(164-88) 경기도 수원시 팔달구 인계로 170031-290-44012
1광주전남지역본부공간정보사업처(619-47) 광주광역시 서구 상무중앙로 102062-714-68052
2대구경북지역본부공간정보사업처(426-20) 대구광역시 달서구 이곡동로 8053-714-78512
3대전세종충남지역본부공간정보사업처(352-62) 대전광역시 서구 문정로48번길 12042-718-40392
4부산울산지역본부공간정보사업처(477-06) 부산광역시 동래구 금강로 77051-553-77042
5서울지역본부공간정보사업처(060-53) 서울특별시 강남구 언주로 70302-6937-20462
6인천지역본부공간정보사업처(215-59) 인천광역시 남동구 인주대로551번길 42032-713-25362
7전북지역본부공간정보사업처(549-16) 전라북도 전주시 덕진구 백제대로 730063-240-27402
8전북지역본부진안장수지사(554-33) 전라북도 진안군 진안읍 성산2길 21-11063-786-59812
9충북지역본부청주서부지사(285-64) 충청북도 청주시 서원구 예체로 149043-261-18502