Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells2961
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Text3
Boolean1

Dataset

Description부산광역시 북구 관내에 있는 U옥외광고물통합관리시스템의 법정동코드 정보로 법정동코드, 법정동명, 존폐여부 등의 항목을 제공하고 있습니다.
Author부산광역시 북구
URLhttps://www.data.go.kr/data/15050089/fileData.do

Alerts

존폐여부 is highly imbalanced (61.8%)Imbalance
has 2961 (29.6%) missing valuesMissing
법정동코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:55:11.419563
Analysis finished2023-12-12 06:55:12.548772
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

법정동코드
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.3623181 × 109
Minimum1.1110101 × 109
Maximum5.013032 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:55:12.624487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1110101 × 109
5-th percentile2.871025 × 109
Q14.273031 × 109
median4.5140134 × 109
Q34.717037 × 109
95-th percentile4.884037 × 109
Maximum5.013032 × 109
Range3.9020219 × 109
Interquartile range (IQR)4.44006 × 108

Descriptive statistics

Standard deviation6.6012956 × 108
Coefficient of variation (CV)0.15132541
Kurtosis10.612056
Mean4.3623181 × 109
Median Absolute Deviation (MAD)2.139972 × 108
Skewness-3.0484431
Sum4.3623181 × 1013
Variance4.3577103 × 1017
MonotonicityNot monotonic
2023-12-12T15:55:12.785436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4127310100 1
 
< 0.1%
4723025029 1
 
< 0.1%
4371025326 1
 
< 0.1%
4572025029 1
 
< 0.1%
4686032028 1
 
< 0.1%
4376040026 1
 
< 0.1%
4889035033 1
 
< 0.1%
4421032026 1
 
< 0.1%
3020010500 1
 
< 0.1%
4684033031 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1111010100 1
< 0.1%
1111010200 1
< 0.1%
1111010300 1
< 0.1%
1111010600 1
< 0.1%
1111010800 1
< 0.1%
1111010900 1
< 0.1%
1111011200 1
< 0.1%
1111011400 1
< 0.1%
1111011500 1
< 0.1%
1111011600 1
< 0.1%
ValueCountFrequency (%)
5013032025 1
< 0.1%
5013032022 1
< 0.1%
5013031030 1
< 0.1%
5013031028 1
< 0.1%
5013031027 1
< 0.1%
5013031026 1
< 0.1%
5013031025 1
< 0.1%
5013031024 1
< 0.1%
5013025927 1
< 0.1%
5013025926 1
< 0.1%
Distinct3136
Distinct (%)31.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T15:55:13.164936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.0064
Min length2

Characters and Unicode

Total characters30064
Distinct characters351
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1694 ?
Unique (%)16.9%

Sample

1st row고잔동
2nd row금산면
3rd row쌍백면
4th row설악면
5th row대창동
ValueCountFrequency (%)
남면 96
 
1.0%
북구 57
 
0.6%
서면 54
 
0.5%
북면 49
 
0.5%
금성면 41
 
0.4%
동면 40
 
0.4%
남구 33
 
0.3%
동남구 33
 
0.3%
서북구 32
 
0.3%
처인구 31
 
0.3%
Other values (3126) 9534
95.3%
2023-12-12T15:55:13.697682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6241
 
20.8%
2230
 
7.4%
1314
 
4.4%
910
 
3.0%
630
 
2.1%
606
 
2.0%
537
 
1.8%
454
 
1.5%
423
 
1.4%
391
 
1.3%
Other values (341) 16328
54.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29850
99.3%
Decimal Number 210
 
0.7%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6241
 
20.9%
2230
 
7.5%
1314
 
4.4%
910
 
3.0%
630
 
2.1%
606
 
2.0%
537
 
1.8%
454
 
1.5%
423
 
1.4%
391
 
1.3%
Other values (331) 16114
54.0%
Decimal Number
ValueCountFrequency (%)
1 69
32.9%
2 61
29.0%
3 38
18.1%
5 16
 
7.6%
4 15
 
7.1%
6 6
 
2.9%
7 4
 
1.9%
8 1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29846
99.3%
Common 214
 
0.7%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6241
 
20.9%
2230
 
7.5%
1314
 
4.4%
910
 
3.0%
630
 
2.1%
606
 
2.0%
537
 
1.8%
454
 
1.5%
423
 
1.4%
391
 
1.3%
Other values (328) 16110
54.0%
Common
ValueCountFrequency (%)
1 69
32.2%
2 61
28.5%
3 38
17.8%
5 16
 
7.5%
4 15
 
7.0%
6 6
 
2.8%
7 4
 
1.9%
) 2
 
0.9%
( 2
 
0.9%
8 1
 
0.5%
Han
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29846
99.3%
ASCII 214
 
0.7%
CJK 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6241
 
20.9%
2230
 
7.5%
1314
 
4.4%
910
 
3.0%
630
 
2.1%
606
 
2.0%
537
 
1.8%
454
 
1.5%
423
 
1.4%
391
 
1.3%
Other values (328) 16110
54.0%
ASCII
ValueCountFrequency (%)
1 69
32.2%
2 61
28.5%
3 38
17.8%
5 16
 
7.5%
4 15
 
7.0%
6 6
 
2.8%
7 4
 
1.9%
) 2
 
0.9%
( 2
 
0.9%
8 1
 
0.5%
CJK
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

존폐여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
9256 
False
 
744
ValueCountFrequency (%)
True 9256
92.6%
False 744
 
7.4%
2023-12-12T15:55:13.819542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct9994
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T15:55:14.155520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length15.1336
Min length10

Characters and Unicode

Total characters151336
Distinct characters386
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9988 ?
Unique (%)99.9%

Sample

1st row경기도 안산시 단원구 고잔동
2nd row전라남도 고흥군 금산면 어전리
3rd row경상남도 합천군 쌍백면 육리
4th row경기도 가평군 설악면
5th row경상남도 마산시 합포구 대창동
ValueCountFrequency (%)
경상북도 1543
 
4.0%
전라남도 1400
 
3.7%
경기도 1390
 
3.6%
충청남도 1179
 
3.1%
경상남도 1135
 
3.0%
전라북도 857
 
2.2%
충청북도 832
 
2.2%
강원도 657
 
1.7%
서울특별시 205
 
0.5%
천안시 178
 
0.5%
Other values (7231) 28754
75.4%
2023-12-12T15:55:14.659076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28130
 
18.6%
9800
 
6.5%
7703
 
5.1%
6787
 
4.5%
5370
 
3.5%
5136
 
3.4%
4888
 
3.2%
4412
 
2.9%
3893
 
2.6%
3208
 
2.1%
Other values (376) 72009
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 122985
81.3%
Space Separator 28130
 
18.6%
Decimal Number 213
 
0.1%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9800
 
8.0%
7703
 
6.3%
6787
 
5.5%
5370
 
4.4%
5136
 
4.2%
4888
 
4.0%
4412
 
3.6%
3893
 
3.2%
3208
 
2.6%
2931
 
2.4%
Other values (365) 68857
56.0%
Decimal Number
ValueCountFrequency (%)
1 70
32.9%
2 62
29.1%
3 39
18.3%
5 16
 
7.5%
4 15
 
7.0%
6 6
 
2.8%
7 4
 
1.9%
8 1
 
0.5%
Space Separator
ValueCountFrequency (%)
28130
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 122977
81.3%
Common 28351
 
18.7%
Han 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9800
 
8.0%
7703
 
6.3%
6787
 
5.5%
5370
 
4.4%
5136
 
4.2%
4888
 
4.0%
4412
 
3.6%
3893
 
3.2%
3208
 
2.6%
2931
 
2.4%
Other values (360) 68849
56.0%
Common
ValueCountFrequency (%)
28130
99.2%
1 70
 
0.2%
2 62
 
0.2%
3 39
 
0.1%
5 16
 
0.1%
4 15
 
0.1%
6 6
 
< 0.1%
( 4
 
< 0.1%
7 4
 
< 0.1%
) 4
 
< 0.1%
Han
ValueCountFrequency (%)
3
37.5%
2
25.0%
1
 
12.5%
1
 
12.5%
1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 122977
81.3%
ASCII 28351
 
18.7%
CJK 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28130
99.2%
1 70
 
0.2%
2 62
 
0.2%
3 39
 
0.1%
5 16
 
0.1%
4 15
 
0.1%
6 6
 
< 0.1%
( 4
 
< 0.1%
7 4
 
< 0.1%
) 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
9800
 
8.0%
7703
 
6.3%
6787
 
5.5%
5370
 
4.4%
5136
 
4.2%
4888
 
4.0%
4412
 
3.6%
3893
 
3.2%
3208
 
2.6%
2931
 
2.4%
Other values (360) 68849
56.0%
CJK
ValueCountFrequency (%)
3
37.5%
2
25.0%
1
 
12.5%
1
 
12.5%
1
 
12.5%


Text

MISSING 

Distinct4182
Distinct (%)59.4%
Missing2961
Missing (%)29.6%
Memory size156.2 KiB
2023-12-12T15:55:15.033508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length2.9893451
Min length2

Characters and Unicode

Total characters21042
Distinct characters347
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2978 ?
Unique (%)42.3%

Sample

1st row어전리
2nd row육리
3rd row반계리
4th row이송천리
5th row추전리
ValueCountFrequency (%)
용산리 21
 
0.3%
대곡리 18
 
0.3%
신흥리 18
 
0.3%
오산리 17
 
0.2%
신촌리 16
 
0.2%
덕산리 16
 
0.2%
송정리 16
 
0.2%
신기리 15
 
0.2%
금곡리 15
 
0.2%
신평리 14
 
0.2%
Other values (4172) 6873
97.6%
2023-12-12T15:55:15.507339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7061
33.6%
562
 
2.7%
421
 
2.0%
358
 
1.7%
302
 
1.4%
297
 
1.4%
276
 
1.3%
276
 
1.3%
237
 
1.1%
231
 
1.1%
Other values (337) 11021
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21035
> 99.9%
Decimal Number 3
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7061
33.6%
562
 
2.7%
421
 
2.0%
358
 
1.7%
302
 
1.4%
297
 
1.4%
276
 
1.3%
276
 
1.3%
237
 
1.1%
231
 
1.1%
Other values (332) 11014
52.4%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
3 1
33.3%
2 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21031
99.9%
Common 7
 
< 0.1%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7061
33.6%
562
 
2.7%
421
 
2.0%
358
 
1.7%
302
 
1.4%
297
 
1.4%
276
 
1.3%
276
 
1.3%
237
 
1.1%
231
 
1.1%
Other values (328) 11010
52.4%
Common
ValueCountFrequency (%)
) 2
28.6%
( 2
28.6%
1 1
14.3%
3 1
14.3%
2 1
14.3%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21031
99.9%
ASCII 7
 
< 0.1%
CJK 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7061
33.6%
562
 
2.7%
421
 
2.0%
358
 
1.7%
302
 
1.4%
297
 
1.4%
276
 
1.3%
276
 
1.3%
237
 
1.1%
231
 
1.1%
Other values (328) 11010
52.4%
ASCII
ValueCountFrequency (%)
) 2
28.6%
( 2
28.6%
1 1
14.3%
3 1
14.3%
2 1
14.3%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Interactions

2023-12-12T15:55:12.251012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:55:15.600411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동코드존폐여부
법정동코드1.0000.268
존폐여부0.2681.000
2023-12-12T15:55:15.675996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동코드존폐여부
법정동코드1.0000.286
존폐여부0.2861.000

Missing values

2023-12-12T15:55:12.383102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:55:12.498282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법정동코드법정동명존폐여부법정동 전체명
7904127310100고잔동Y경기도 안산시 단원구 고잔동<NA>
97834677033022금산면Y전라남도 고흥군 금산면 어전리어전리
173424889042022쌍백면Y경상남도 합천군 쌍백면 육리육리
21824182031000설악면Y경기도 가평군 설악면<NA>
198324815110900대창동N경상남도 마산시 합포구 대창동<NA>
167864725038027모동면Y경상북도 상주시 모동면 반계리반계리
65244159010200신남동Y경기도 화성시 신남동<NA>
181224717033024서후면Y경상북도 안동시 서후면 이송천리이송천리
70874211038030북산면Y강원도 춘천시 북산면 추전리추전리
12303023010800법동Y대전광역시 대덕구 법동<NA>
법정동코드법정동명존폐여부법정동 전체명
2334219010600동점동Y강원도 태백시 동점동<NA>
53062611010800중앙동5가Y부산광역시 중구 중앙동5가<NA>
17124146131027처인구Y경기도 용인시 처인구 모현면 능원리능원리
112504421034028지곡면Y충청남도 서산시 지곡면 환성리환성리
101834572039023정천면Y전라북도 진안군 정천면 모정리모정리
135464482534023소원면Y충청남도 태안군 소원면 신덕리신덕리
45114155025028공도읍Y경기도 안성시 공도읍 불당리불당리
172144785037029기산면Y경상북도 칠곡군 기산면 평복리평복리
208544711310600중앙동Y경상북도 포항시 북구 중앙동<NA>
191724713031026양북면Y경상북도 경주시 양북면 구길리구길리