Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory400.4 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시상수도사업본부_원격검침단말기위치정보_20220722
Author부산광역시 상수도사업본부
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15102585

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:37:08.480862
Analysis finished2023-12-10 16:37:10.316262
Duration1.84 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32206.175
Minimum9
Maximum64574
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:37:10.427050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile3044.6
Q115938.5
median32131
Q348496
95-th percentile61443.25
Maximum64574
Range64565
Interquartile range (IQR)32557.5

Descriptive statistics

Standard deviation18737.674
Coefficient of variation (CV)0.58180377
Kurtosis-1.2078299
Mean32206.175
Median Absolute Deviation (MAD)16239.5
Skewness0.005574854
Sum3.2206175 × 108
Variance3.5110043 × 108
MonotonicityNot monotonic
2023-12-11T01:37:10.615317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37003 1
 
< 0.1%
20305 1
 
< 0.1%
23413 1
 
< 0.1%
10609 1
 
< 0.1%
20269 1
 
< 0.1%
49062 1
 
< 0.1%
46145 1
 
< 0.1%
26184 1
 
< 0.1%
29646 1
 
< 0.1%
20893 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
9 1
< 0.1%
12 1
< 0.1%
22 1
< 0.1%
24 1
< 0.1%
31 1
< 0.1%
38 1
< 0.1%
44 1
< 0.1%
45 1
< 0.1%
51 1
< 0.1%
55 1
< 0.1%
ValueCountFrequency (%)
64574 1
< 0.1%
64565 1
< 0.1%
64553 1
< 0.1%
64546 1
< 0.1%
64542 1
< 0.1%
64541 1
< 0.1%
64535 1
< 0.1%
64522 1
< 0.1%
64514 1
< 0.1%
64508 1
< 0.1%
Distinct9340
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:37:11.069992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length3
Mean length5.3595
Min length2

Characters and Unicode

Total characters53595
Distinct characters721
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8846 ?
Unique (%)88.5%

Sample

1st row(주)SY상사
2nd row윤금심
3rd row황성택(농막)
4th row티엔(대인정보통신)
5th row김재근
ValueCountFrequency (%)
301호 39
 
0.4%
201호 27
 
0.3%
302호 23
 
0.2%
501호 21
 
0.2%
402호 19
 
0.2%
401호 18
 
0.2%
202호 18
 
0.2%
101호 15
 
0.1%
a동 13
 
0.1%
203호 13
 
0.1%
Other values (9473) 10394
98.1%
2023-12-11T01:37:11.664251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 2973
 
5.5%
) 2971
 
5.5%
1922
 
3.6%
1481
 
2.8%
1255
 
2.3%
1134
 
2.1%
1066
 
2.0%
0 1005
 
1.9%
807
 
1.5%
1 796
 
1.5%
Other values (711) 38185
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42932
80.1%
Decimal Number 3439
 
6.4%
Open Punctuation 2983
 
5.6%
Close Punctuation 2981
 
5.6%
Space Separator 611
 
1.1%
Uppercase Letter 501
 
0.9%
Other Punctuation 71
 
0.1%
Dash Punctuation 46
 
0.1%
Lowercase Letter 26
 
< 0.1%
Connector Punctuation 2
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1922
 
4.5%
1481
 
3.4%
1255
 
2.9%
1134
 
2.6%
1066
 
2.5%
807
 
1.9%
748
 
1.7%
702
 
1.6%
625
 
1.5%
579
 
1.3%
Other values (647) 32613
76.0%
Uppercase Letter
ValueCountFrequency (%)
A 88
17.6%
B 73
14.6%
T 38
 
7.6%
E 34
 
6.8%
C 31
 
6.2%
S 29
 
5.8%
P 26
 
5.2%
N 25
 
5.0%
H 23
 
4.6%
K 20
 
4.0%
Other values (14) 114
22.8%
Lowercase Letter
ValueCountFrequency (%)
n 6
23.1%
c 4
15.4%
k 3
11.5%
h 2
 
7.7%
g 2
 
7.7%
a 2
 
7.7%
s 1
 
3.8%
o 1
 
3.8%
z 1
 
3.8%
y 1
 
3.8%
Other values (3) 3
11.5%
Decimal Number
ValueCountFrequency (%)
0 1005
29.2%
1 796
23.1%
2 696
20.2%
3 369
 
10.7%
4 203
 
5.9%
5 188
 
5.5%
6 66
 
1.9%
7 51
 
1.5%
8 38
 
1.1%
9 27
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 35
49.3%
. 20
28.2%
& 12
 
16.9%
: 2
 
2.8%
/ 2
 
2.8%
Open Punctuation
ValueCountFrequency (%)
( 2973
99.7%
[ 9
 
0.3%
{ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2971
99.7%
] 9
 
0.3%
} 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
611
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 46
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42932
80.1%
Common 10135
 
18.9%
Latin 528
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1922
 
4.5%
1481
 
3.4%
1255
 
2.9%
1134
 
2.6%
1066
 
2.5%
807
 
1.9%
748
 
1.7%
702
 
1.6%
625
 
1.5%
579
 
1.3%
Other values (647) 32613
76.0%
Latin
ValueCountFrequency (%)
A 88
16.7%
B 73
13.8%
T 38
 
7.2%
E 34
 
6.4%
C 31
 
5.9%
S 29
 
5.5%
P 26
 
4.9%
N 25
 
4.7%
H 23
 
4.4%
K 20
 
3.8%
Other values (28) 141
26.7%
Common
ValueCountFrequency (%)
( 2973
29.3%
) 2971
29.3%
0 1005
 
9.9%
1 796
 
7.9%
2 696
 
6.9%
611
 
6.0%
3 369
 
3.6%
4 203
 
2.0%
5 188
 
1.9%
6 66
 
0.7%
Other values (16) 257
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42932
80.1%
ASCII 10661
 
19.9%
Number Forms 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 2973
27.9%
) 2971
27.9%
0 1005
 
9.4%
1 796
 
7.5%
2 696
 
6.5%
611
 
5.7%
3 369
 
3.5%
4 203
 
1.9%
5 188
 
1.8%
A 88
 
0.8%
Other values (52) 761
 
7.1%
Hangul
ValueCountFrequency (%)
1922
 
4.5%
1481
 
3.4%
1255
 
2.9%
1134
 
2.6%
1066
 
2.5%
807
 
1.9%
748
 
1.7%
702
 
1.6%
625
 
1.5%
579
 
1.3%
Other values (647) 32613
76.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

구역
Text

Distinct228
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:37:12.101435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length29
Mean length23.291
Min length18

Characters and Unicode

Total characters232910
Distinct characters128
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row부산광역시 강서 사업소 구역 미지정 수용가
2nd row부산광역시 기장 사업소 철마면
3rd row부산광역시 기장 사업소 장안읍
4th row부산광역시 강서 사업소 대저2동
5th row부산광역시 중동부 사업소 초량3동
ValueCountFrequency (%)
부산광역시 10000
25.5%
사업소 8790
22.4%
강서 3558
 
9.1%
기장 1621
 
4.1%
동래통합사업소 1159
 
3.0%
녹산동 848
 
2.2%
남부 692
 
1.8%
대저1동 671
 
1.7%
대저2동 630
 
1.6%
부산진 628
 
1.6%
Other values (225) 10619
27.1%
2023-12-11T01:37:12.779001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92624
39.8%
12932
 
5.6%
11742
 
5.0%
10538
 
4.5%
10371
 
4.5%
10271
 
4.4%
10213
 
4.4%
10067
 
4.3%
10023
 
4.3%
9949
 
4.3%
Other values (118) 44180
19.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 135352
58.1%
Space Separator 92624
39.8%
Decimal Number 4934
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12932
 
9.6%
11742
 
8.7%
10538
 
7.8%
10371
 
7.7%
10271
 
7.6%
10213
 
7.5%
10067
 
7.4%
10023
 
7.4%
9949
 
7.4%
3980
 
2.9%
Other values (109) 35266
26.1%
Decimal Number
ValueCountFrequency (%)
1 2029
41.1%
2 1853
37.6%
3 571
 
11.6%
4 233
 
4.7%
5 120
 
2.4%
6 96
 
1.9%
9 26
 
0.5%
8 6
 
0.1%
Space Separator
ValueCountFrequency (%)
92624
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 135352
58.1%
Common 97558
41.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12932
 
9.6%
11742
 
8.7%
10538
 
7.8%
10371
 
7.7%
10271
 
7.6%
10213
 
7.5%
10067
 
7.4%
10023
 
7.4%
9949
 
7.4%
3980
 
2.9%
Other values (109) 35266
26.1%
Common
ValueCountFrequency (%)
92624
94.9%
1 2029
 
2.1%
2 1853
 
1.9%
3 571
 
0.6%
4 233
 
0.2%
5 120
 
0.1%
6 96
 
0.1%
9 26
 
< 0.1%
8 6
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 135352
58.1%
ASCII 97558
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
92624
94.9%
1 2029
 
2.1%
2 1853
 
1.9%
3 571
 
0.6%
4 233
 
0.2%
5 120
 
0.1%
6 96
 
0.1%
9 26
 
< 0.1%
8 6
 
< 0.1%
Hangul
ValueCountFrequency (%)
12932
 
9.6%
11742
 
8.7%
10538
 
7.8%
10371
 
7.7%
10271
 
7.6%
10213
 
7.5%
10067
 
7.4%
10023
 
7.4%
9949
 
7.4%
3980
 
2.9%
Other values (109) 35266
26.1%

주소
Text

Distinct9589
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:37:13.242649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length79
Mean length30.0712
Min length6

Characters and Unicode

Total characters300712
Distinct characters667
Distinct categories15 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9370 ?
Unique (%)93.7%

Sample

1st row해척동서길 79 (명지동) /명지동 748-7/폐전/임뿌리/211115o014270/88571DFFFE37919F
2nd row철마면 연구리 116-6
3rd row장안읍 좌동리 32 /부산광역시 기장군 장안읍 좌동리 32
4th row유통단지1로97번길 11 / (119호)(대저2동) /대저2동 3146-1 (119동)
5th row부산광역시 초량3동 122-51 /홍곡로 20 (초량동)
ValueCountFrequency (%)
부산광역시 1996
 
3.8%
대저1동 1188
 
2.3%
대저2동 1187
 
2.3%
강동동 720
 
1.4%
명지동 708
 
1.4%
장안읍 661
 
1.3%
정관읍 524
 
1.0%
녹산동 421
 
0.8%
철마면 419
 
0.8%
기장읍 406
 
0.8%
Other values (15013) 43625
84.1%
2023-12-11T01:37:13.928123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43592
 
14.5%
1 19500
 
6.5%
18165
 
6.0%
2 14047
 
4.7%
- 10608
 
3.5%
3 10215
 
3.4%
/ 10053
 
3.3%
) 8990
 
3.0%
( 8974
 
3.0%
4 7947
 
2.6%
Other values (657) 148621
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 128606
42.8%
Decimal Number 88251
29.3%
Space Separator 43592
 
14.5%
Dash Punctuation 10608
 
3.5%
Other Punctuation 10146
 
3.4%
Close Punctuation 9026
 
3.0%
Open Punctuation 9010
 
3.0%
Uppercase Letter 966
 
0.3%
Lowercase Letter 478
 
0.2%
Math Symbol 17
 
< 0.1%
Other values (5) 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18165
 
14.1%
7719
 
6.0%
7179
 
5.6%
5511
 
4.3%
4665
 
3.6%
4225
 
3.3%
2833
 
2.2%
2753
 
2.1%
2733
 
2.1%
2275
 
1.8%
Other values (580) 70548
54.9%
Uppercase Letter
ValueCountFrequency (%)
F 164
17.0%
B 148
15.3%
A 137
14.2%
D 125
12.9%
E 84
8.7%
C 62
 
6.4%
L 45
 
4.7%
P 38
 
3.9%
T 22
 
2.3%
S 19
 
2.0%
Other values (14) 122
12.6%
Lowercase Letter
ValueCountFrequency (%)
f 169
35.4%
o 82
17.2%
e 74
15.5%
d 70
14.6%
a 27
 
5.6%
c 18
 
3.8%
b 15
 
3.1%
n 3
 
0.6%
y 3
 
0.6%
i 3
 
0.6%
Other values (7) 14
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 19500
22.1%
2 14047
15.9%
3 10215
11.6%
4 7947
9.0%
5 7325
 
8.3%
6 6646
 
7.5%
7 6135
 
7.0%
0 5616
 
6.4%
8 5510
 
6.2%
9 5310
 
6.0%
Other Punctuation
ValueCountFrequency (%)
/ 10053
99.1%
: 36
 
0.4%
. 29
 
0.3%
; 16
 
0.2%
@ 4
 
< 0.1%
, 3
 
< 0.1%
& 3
 
< 0.1%
# 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 14
82.4%
> 1
 
5.9%
+ 1
 
5.9%
~ 1
 
5.9%
Close Punctuation
ValueCountFrequency (%)
) 8990
99.6%
] 34
 
0.4%
} 2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 8974
99.6%
[ 34
 
0.4%
{ 2
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
43592
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10608
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 170660
56.8%
Hangul 128606
42.8%
Latin 1446
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18165
 
14.1%
7719
 
6.0%
7179
 
5.6%
5511
 
4.3%
4665
 
3.6%
4225
 
3.3%
2833
 
2.2%
2753
 
2.1%
2733
 
2.1%
2275
 
1.8%
Other values (580) 70548
54.9%
Latin
ValueCountFrequency (%)
f 169
11.7%
F 164
11.3%
B 148
10.2%
A 137
9.5%
D 125
 
8.6%
E 84
 
5.8%
o 82
 
5.7%
e 74
 
5.1%
d 70
 
4.8%
C 62
 
4.3%
Other values (32) 331
22.9%
Common
ValueCountFrequency (%)
43592
25.5%
1 19500
11.4%
2 14047
 
8.2%
- 10608
 
6.2%
3 10215
 
6.0%
/ 10053
 
5.9%
) 8990
 
5.3%
( 8974
 
5.3%
4 7947
 
4.7%
5 7325
 
4.3%
Other values (25) 29409
17.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 172100
57.2%
Hangul 128602
42.8%
Compat Jamo 4
 
< 0.1%
Misc Symbols 3
 
< 0.1%
Number Forms 2
 
< 0.1%
Geometric Shapes 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43592
25.3%
1 19500
11.3%
2 14047
 
8.2%
- 10608
 
6.2%
3 10215
 
5.9%
/ 10053
 
5.8%
) 8990
 
5.2%
( 8974
 
5.2%
4 7947
 
4.6%
5 7325
 
4.3%
Other values (64) 30849
17.9%
Hangul
ValueCountFrequency (%)
18165
 
14.1%
7719
 
6.0%
7179
 
5.6%
5511
 
4.3%
4665
 
3.6%
4225
 
3.3%
2833
 
2.2%
2753
 
2.1%
2733
 
2.1%
2275
 
1.8%
Other values (579) 70544
54.9%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Misc Symbols
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
2
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-11T01:37:09.951905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T01:37:10.114110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:37:10.250175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호수용가명구역주소
3700237003(주)SY상사부산광역시 강서 사업소 구역 미지정 수용가해척동서길 79 (명지동) /명지동 748-7/폐전/임뿌리/211115o014270/88571DFFFE37919F
6333263333윤금심부산광역시 기장 사업소 철마면철마면 연구리 116-6
5963659637황성택(농막)부산광역시 기장 사업소 장안읍장안읍 좌동리 32 /부산광역시 기장군 장안읍 좌동리 32
4752447525티엔(대인정보통신)부산광역시 강서 사업소 대저2동유통단지1로97번길 11 / (119호)(대저2동) /대저2동 3146-1 (119동)
27022703김재근부산광역시 중동부 사업소 초량3동부산광역시 초량3동 122-51 /홍곡로 20 (초량동)
1074010741김인자부산광역시 동래통합사업소 안락1동부산광역시 안락1동 995-7 /명륜로112번길 175-6 (안락동)
3020530206김영민부산광역시 북부 사업소 덕포2동덕포2동 22-2 /덕상로72번길 37-5 (덕포동)
1729217293김용기부산광역시 사하 사업소 다대1동다대동 831/다대동로 35-1 (다대동)
5397553976이건숙부산광역시 강서 사업소 녹산동녹산화전로117번길 115-19 (화전동) /화전동 379
55645565남동호(주)DHN개발부산광역시 부산진 사업소 부전1동부전1동 266-31 1층밀면 /새싹로8번길 32 (부전동/ 1층밀면)
번호수용가명구역주소
20642065김위수부산광역시 서부 사업소 남부민1동남부민1동 23-183 /천마로199번길 35-11 (남부민동)
6037760378현대건설부산광역시 강서 사업소 강동동강서구 강동동 4999-3
1108711088김대열부산광역시 동래통합사업소 안락2동부산광역시 안락2동 144-34 /충렬대로446번나길 16 (안락동)
4787247873(주)두성공업사부산광역시 강서 사업소 명지동낙동남로1013번길 50 (명지동) /명지동 3178-9
2909629097이길우(자매국밥)부산광역시 남부 사업소 민락동민락동 32-10 /민락본동로27번길 56 (민락동)
2702927030이지선부산광역시 동래통합사업소 연산9동부산광역시 연산9동 117-3 /과정로114번길 30-12 (연산동)
3754837549김평옥부산광역시 사하 사업소 신평2동하신번영로161번길 14 (신평동)/신평2동 615-14
4693346934신순희부산광역시 강서 사업소 명지동명지오션시티8로6번길 8 (명지동) /명지동 3248-11
169170김승일부산광역시 중동부 사업소 대청동대청동4가 26-38 /중구로97번길 14-1 (대청동4가)
2181621817김명심(유대규)부산광역시 강서 사업소 대저1동부산광역시 낙동북로212번길 34 (대저1동) /대저1동 1703-23