Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory400.4 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시상수도사업본부_원격검침단말기위치정보_20230117
Author부산광역시 상수도사업본부
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15102585

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:37:00.182747
Analysis finished2023-12-10 16:37:01.862728
Duration1.68 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34852.688
Minimum11
Maximum68765
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:37:01.982953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile3597.35
Q118154.75
median35009.5
Q351939
95-th percentile65494.4
Maximum68765
Range68754
Interquartile range (IQR)33784.25

Descriptive statistics

Standard deviation19721.29
Coefficient of variation (CV)0.56584702
Kurtosis-1.1820407
Mean34852.688
Median Absolute Deviation (MAD)16899.5
Skewness-0.027399904
Sum3.4852688 × 108
Variance3.8892926 × 108
MonotonicityNot monotonic
2023-12-11T01:37:02.162775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
44042 1
 
< 0.1%
48394 1
 
< 0.1%
23849 1
 
< 0.1%
32459 1
 
< 0.1%
31076 1
 
< 0.1%
47209 1
 
< 0.1%
36648 1
 
< 0.1%
21483 1
 
< 0.1%
29822 1
 
< 0.1%
33779 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
11 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
24 1
< 0.1%
31 1
< 0.1%
37 1
< 0.1%
41 1
< 0.1%
45 1
< 0.1%
48 1
< 0.1%
51 1
< 0.1%
ValueCountFrequency (%)
68765 1
< 0.1%
68754 1
< 0.1%
68726 1
< 0.1%
68721 1
< 0.1%
68706 1
< 0.1%
68700 1
< 0.1%
68696 1
< 0.1%
68689 1
< 0.1%
68678 1
< 0.1%
68677 1
< 0.1%
Distinct6535
Distinct (%)65.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:37:02.574569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length3
Mean length5.4463
Min length2

Characters and Unicode

Total characters54463
Distinct characters719
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5549 ?
Unique (%)55.5%

Sample

1st row김*운
2nd row신*보리밥(이순학)
3rd row방*년
4th row조*수(수정제일교회)
5th row토*스바버
ValueCountFrequency (%)
김*자 58
 
0.5%
김*수 53
 
0.5%
김*순 42
 
0.4%
김*숙 35
 
0.3%
김*호 34
 
0.3%
이*자 33
 
0.3%
김*진 33
 
0.3%
김*희 31
 
0.3%
이*희 31
 
0.3%
이*숙 30
 
0.3%
Other values (6706) 10217
96.4%
2023-12-11T01:37:03.202399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 10000
 
18.4%
( 3161
 
5.8%
) 3152
 
5.8%
1830
 
3.4%
1499
 
2.8%
1172
 
2.2%
775
 
1.4%
1 760
 
1.4%
739
 
1.4%
0 712
 
1.3%
Other values (709) 30663
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33963
62.4%
Other Punctuation 10065
 
18.5%
Open Punctuation 3170
 
5.8%
Close Punctuation 3161
 
5.8%
Decimal Number 3029
 
5.6%
Space Separator 606
 
1.1%
Uppercase Letter 405
 
0.7%
Dash Punctuation 42
 
0.1%
Lowercase Letter 20
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1830
 
5.4%
1499
 
4.4%
1172
 
3.5%
775
 
2.3%
739
 
2.2%
565
 
1.7%
468
 
1.4%
443
 
1.3%
418
 
1.2%
417
 
1.2%
Other values (649) 25637
75.5%
Uppercase Letter
ValueCountFrequency (%)
A 74
18.3%
B 60
14.8%
T 30
 
7.4%
S 30
 
7.4%
C 27
 
6.7%
G 20
 
4.9%
P 20
 
4.9%
E 19
 
4.7%
D 18
 
4.4%
H 16
 
4.0%
Other values (14) 91
22.5%
Lowercase Letter
ValueCountFrequency (%)
e 6
30.0%
o 4
20.0%
k 2
 
10.0%
r 1
 
5.0%
w 1
 
5.0%
s 1
 
5.0%
u 1
 
5.0%
d 1
 
5.0%
h 1
 
5.0%
a 1
 
5.0%
Decimal Number
ValueCountFrequency (%)
1 760
25.1%
0 712
23.5%
2 694
22.9%
3 350
11.6%
4 196
 
6.5%
5 141
 
4.7%
6 62
 
2.0%
7 49
 
1.6%
9 33
 
1.1%
8 32
 
1.1%
Other Punctuation
ValueCountFrequency (%)
* 10000
99.4%
, 36
 
0.4%
. 13
 
0.1%
/ 8
 
0.1%
& 4
 
< 0.1%
: 3
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 3161
99.7%
[ 9
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 3152
99.7%
] 9
 
0.3%
Space Separator
ValueCountFrequency (%)
606
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33963
62.4%
Common 20074
36.9%
Latin 426
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1830
 
5.4%
1499
 
4.4%
1172
 
3.5%
775
 
2.3%
739
 
2.2%
565
 
1.7%
468
 
1.4%
443
 
1.3%
418
 
1.2%
417
 
1.2%
Other values (649) 25637
75.5%
Latin
ValueCountFrequency (%)
A 74
17.4%
B 60
14.1%
T 30
 
7.0%
S 30
 
7.0%
C 27
 
6.3%
G 20
 
4.7%
P 20
 
4.7%
E 19
 
4.5%
D 18
 
4.2%
H 16
 
3.8%
Other values (26) 112
26.3%
Common
ValueCountFrequency (%)
* 10000
49.8%
( 3161
 
15.7%
) 3152
 
15.7%
1 760
 
3.8%
0 712
 
3.5%
2 694
 
3.5%
606
 
3.0%
3 350
 
1.7%
4 196
 
1.0%
5 141
 
0.7%
Other values (14) 302
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33962
62.4%
ASCII 20498
37.6%
Number Forms 1
 
< 0.1%
None 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 10000
48.8%
( 3161
 
15.4%
) 3152
 
15.4%
1 760
 
3.7%
0 712
 
3.5%
2 694
 
3.4%
606
 
3.0%
3 350
 
1.7%
4 196
 
1.0%
5 141
 
0.7%
Other values (48) 726
 
3.5%
Hangul
ValueCountFrequency (%)
1830
 
5.4%
1499
 
4.4%
1172
 
3.5%
775
 
2.3%
739
 
2.2%
565
 
1.7%
468
 
1.4%
443
 
1.3%
418
 
1.2%
417
 
1.2%
Other values (648) 25636
75.5%
Number Forms
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

구역
Text

Distinct229
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:37:03.661071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length23
Mean length23.3956
Min length18

Characters and Unicode

Total characters233956
Distinct characters128
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row부산광역시 강서 사업소 가덕도동
2nd row부산광역시 부산진 사업소 범천4동
3rd row부산광역시 기장 사업소 철마면
4th row부산광역시 북부 사업소 덕천2동
5th row부산광역시 부산진 사업소 부전2동
ValueCountFrequency (%)
부산광역시 10000
25.3%
사업소 8806
22.3%
강서 3497
 
8.8%
기장 1964
 
5.0%
동래통합사업소 1156
 
2.9%
녹산동 844
 
2.1%
남부 677
 
1.7%
대저1동 611
 
1.5%
대저2동 572
 
1.4%
북부 555
 
1.4%
Other values (226) 10892
27.5%
2023-12-11T01:37:04.377398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93332
39.9%
12724
 
5.4%
11675
 
5.0%
10493
 
4.5%
10386
 
4.4%
10384
 
4.4%
10055
 
4.3%
10014
 
4.3%
9962
 
4.3%
9909
 
4.2%
Other values (118) 45022
19.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 136062
58.2%
Space Separator 93332
39.9%
Decimal Number 4562
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12724
 
9.4%
11675
 
8.6%
10493
 
7.7%
10386
 
7.6%
10384
 
7.6%
10055
 
7.4%
10014
 
7.4%
9962
 
7.3%
9909
 
7.3%
3897
 
2.9%
Other values (109) 36563
26.9%
Decimal Number
ValueCountFrequency (%)
1 1846
40.5%
2 1691
37.1%
3 568
 
12.5%
4 229
 
5.0%
5 106
 
2.3%
6 80
 
1.8%
9 33
 
0.7%
8 9
 
0.2%
Space Separator
ValueCountFrequency (%)
93332
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 136062
58.2%
Common 97894
41.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12724
 
9.4%
11675
 
8.6%
10493
 
7.7%
10386
 
7.6%
10384
 
7.6%
10055
 
7.4%
10014
 
7.4%
9962
 
7.3%
9909
 
7.3%
3897
 
2.9%
Other values (109) 36563
26.9%
Common
ValueCountFrequency (%)
93332
95.3%
1 1846
 
1.9%
2 1691
 
1.7%
3 568
 
0.6%
4 229
 
0.2%
5 106
 
0.1%
6 80
 
0.1%
9 33
 
< 0.1%
8 9
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 136062
58.2%
ASCII 97894
41.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
93332
95.3%
1 1846
 
1.9%
2 1691
 
1.7%
3 568
 
0.6%
4 229
 
0.2%
5 106
 
0.1%
6 80
 
0.1%
9 33
 
< 0.1%
8 9
 
< 0.1%
Hangul
ValueCountFrequency (%)
12724
 
9.4%
11675
 
8.6%
10493
 
7.7%
10386
 
7.6%
10384
 
7.6%
10055
 
7.4%
10014
 
7.4%
9962
 
7.3%
9909
 
7.3%
3897
 
2.9%
Other values (109) 36563
26.9%

주소
Text

Distinct9655
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:37:04.901351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length97
Median length78
Mean length29.3815
Min length6

Characters and Unicode

Total characters293815
Distinct characters685
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9434 ?
Unique (%)94.3%

Sample

1st row가덕해안로 819 (천성동) /천성동 491
2nd row범천4동 980-161 /신암로 100-8 (범천동)
3rd row철마면 임기리 980
4th row덕천2동 524-4 /금곡대로92번길 50 (덕천동)
5th row부전제2동 535-4 /중앙대로 663 (부전동)
ValueCountFrequency (%)
부산광역시 1757
 
3.5%
대저1동 1090
 
2.1%
대저2동 1050
 
2.1%
강동동 729
 
1.4%
명지동 725
 
1.4%
정관읍 599
 
1.2%
장안읍 578
 
1.1%
기장읍 545
 
1.1%
철마면 431
 
0.8%
송정동 407
 
0.8%
Other values (15004) 42968
84.5%
2023-12-11T01:37:05.686075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42861
 
14.6%
1 19110
 
6.5%
17376
 
5.9%
2 13447
 
4.6%
- 10466
 
3.6%
3 9969
 
3.4%
/ 9630
 
3.3%
) 8627
 
2.9%
( 8613
 
2.9%
4 7740
 
2.6%
Other values (675) 145976
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125432
42.7%
Decimal Number 86443
29.4%
Space Separator 42861
 
14.6%
Dash Punctuation 10466
 
3.6%
Other Punctuation 9718
 
3.3%
Close Punctuation 8670
 
3.0%
Open Punctuation 8655
 
2.9%
Uppercase Letter 1045
 
0.4%
Lowercase Letter 497
 
0.2%
Math Symbol 22
 
< 0.1%
Other values (3) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17376
 
13.9%
7554
 
6.0%
7155
 
5.7%
5370
 
4.3%
4332
 
3.5%
4023
 
3.2%
2717
 
2.2%
2575
 
2.1%
2462
 
2.0%
2130
 
1.7%
Other values (600) 69738
55.6%
Uppercase Letter
ValueCountFrequency (%)
F 215
20.6%
B 171
16.4%
D 156
14.9%
A 149
14.3%
E 107
10.2%
C 44
 
4.2%
L 37
 
3.5%
P 28
 
2.7%
S 19
 
1.8%
T 16
 
1.5%
Other values (13) 103
9.9%
Lowercase Letter
ValueCountFrequency (%)
f 155
31.2%
o 92
18.5%
d 82
16.5%
e 77
15.5%
c 20
 
4.0%
a 19
 
3.8%
b 17
 
3.4%
l 6
 
1.2%
s 6
 
1.2%
i 4
 
0.8%
Other values (11) 19
 
3.8%
Decimal Number
ValueCountFrequency (%)
1 19110
22.1%
2 13447
15.6%
3 9969
11.5%
4 7740
9.0%
5 7232
 
8.4%
6 6518
 
7.5%
7 6268
 
7.3%
0 5628
 
6.5%
8 5397
 
6.2%
9 5134
 
5.9%
Other Punctuation
ValueCountFrequency (%)
/ 9630
99.1%
: 41
 
0.4%
; 19
 
0.2%
. 15
 
0.2%
, 8
 
0.1%
@ 3
 
< 0.1%
# 1
 
< 0.1%
& 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 17
77.3%
~ 3
 
13.6%
× 2
 
9.1%
Close Punctuation
ValueCountFrequency (%)
) 8627
99.5%
] 43
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 8613
99.5%
[ 42
 
0.5%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
42861
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10466
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 166841
56.8%
Hangul 125432
42.7%
Latin 1542
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17376
 
13.9%
7554
 
6.0%
7155
 
5.7%
5370
 
4.3%
4332
 
3.5%
4023
 
3.2%
2717
 
2.2%
2575
 
2.1%
2462
 
2.0%
2130
 
1.7%
Other values (600) 69738
55.6%
Latin
ValueCountFrequency (%)
F 215
13.9%
B 171
11.1%
D 156
10.1%
f 155
10.1%
A 149
9.7%
E 107
 
6.9%
o 92
 
6.0%
d 82
 
5.3%
e 77
 
5.0%
C 44
 
2.9%
Other values (34) 294
19.1%
Common
ValueCountFrequency (%)
42861
25.7%
1 19110
11.5%
2 13447
 
8.1%
- 10466
 
6.3%
3 9969
 
6.0%
/ 9630
 
5.8%
) 8627
 
5.2%
( 8613
 
5.2%
4 7740
 
4.6%
5 7232
 
4.3%
Other values (21) 29146
17.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 168378
57.3%
Hangul 125428
42.7%
Compat Jamo 4
 
< 0.1%
None 2
 
< 0.1%
Misc Symbols 2
 
< 0.1%
Geometric Shapes 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42861
25.5%
1 19110
11.3%
2 13447
 
8.0%
- 10466
 
6.2%
3 9969
 
5.9%
/ 9630
 
5.7%
) 8627
 
5.1%
( 8613
 
5.1%
4 7740
 
4.6%
5 7232
 
4.3%
Other values (62) 30683
18.2%
Hangul
ValueCountFrequency (%)
17376
 
13.9%
7554
 
6.0%
7155
 
5.7%
5370
 
4.3%
4332
 
3.5%
4023
 
3.2%
2717
 
2.2%
2575
 
2.1%
2462
 
2.0%
2130
 
1.7%
Other values (599) 69734
55.6%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
None
ValueCountFrequency (%)
× 2
100.0%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-11T01:37:01.463575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T01:37:01.657148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:37:01.785811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호수용가명구역주소
4404144042김*운부산광역시 강서 사업소 가덕도동가덕해안로 819 (천성동) /천성동 491
94649465신*보리밥(이순학)부산광역시 부산진 사업소 범천4동범천4동 980-161 /신암로 100-8 (범천동)
6108961090방*년부산광역시 기장 사업소 철마면철마면 임기리 980
1484814849조*수(수정제일교회)부산광역시 북부 사업소 덕천2동덕천2동 524-4 /금곡대로92번길 50 (덕천동)
60806081토*스바버부산광역시 부산진 사업소 부전2동부전제2동 535-4 /중앙대로 663 (부전동)
2684126842김*준부산광역시 동래통합사업소 연산3동부산광역시 금련로16번길 8-1 (연산동)
98399840장*열부산광역시 동래통합사업소 복산동칠산동 148-4 /명륜로112번길 146-1 (칠산동)
75407541신*진외 3명(김미연)부산광역시 부산진 사업소 전포1동부산광역시 전포1동 339-28 스몰굿커피 /전포대로200번길 19 (전포동/ 스몰굿커피)
2610126102유*준부산광역시 강서 사업소 녹산동낙동남로583번길 32 (녹산동) /녹산동 401
5090550906신*석부산광역시 기장 사업소 일광읍일광면 횡계1길 4
번호수용가명구역주소
1294312944박*용부산광역시 남부 사업소 용호1동동명로105번길 47 (용호동)
5670356704반*빈부산광역시 강서 사업소 가덕도동선창길 110-32 (성북동) /성북동 129-2
3510435105김*근부산광역시 강서 사업소 녹산동낙동남로582번가길 18-6 (녹산동) /녹산동 546-65
3527235273박*영(801호)부산광역시 동래통합사업소 청룡노포동부산광역시 청룡동 17-21 /청룡예전로 1-3 (청룡동/ 예뜨랑빌라)
3598135982이*은(재원금속)부산광역시 강서 사업소 녹산동녹산산업북로 277 (송정동) /송정동 1619-13
4050040501신*이부산광역시 강서 사업소 대저2동맥도길377번길 107 (대저2동)
4124441245박*화(농막)부산광역시 강서 사업소 대저2동맥도강변길 912-22 (대저2동) /대저2동 4467
5284552846사*구청 창조도시기획단장부산광역시 사하 사업소 신평2동신평2동 산107-15/비봉로54번안길 26-1 (신평동)
1259912600김*경부산광역시 남부 사업소 대연5동대연5동 1501-15 /못골번영로 69 (대연동)
3637136372A*403호박창현부산광역시 기장 사업소 기장읍기장읍 동부리 395-1 /기장읍 배산로68번길 5 (남경캐슬)