Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory400.4 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시 상수도사업본부의 원격검침 프로그램(Aqua Smart Metering)에 사용되는 원격검침 단말기 위치정보 자료입니다.
Author부산광역시 상수도사업본부
URLhttps://www.data.go.kr/data/15102585/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:21:32.379232
Analysis finished2023-12-12 04:21:33.839982
Duration1.46 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34738.495
Minimum4
Maximum68763
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:21:33.942668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile3638.75
Q117723.75
median34990.5
Q351759.75
95-th percentile65348.05
Maximum68763
Range68759
Interquartile range (IQR)34036

Descriptive statistics

Standard deviation19737.857
Coefficient of variation (CV)0.568184
Kurtosis-1.1835311
Mean34738.495
Median Absolute Deviation (MAD)16982
Skewness-0.024195895
Sum3.4738495 × 108
Variance3.8958301 × 108
MonotonicityNot monotonic
2023-12-12T13:21:34.116491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39452 1
 
< 0.1%
17964 1
 
< 0.1%
52458 1
 
< 0.1%
22147 1
 
< 0.1%
57098 1
 
< 0.1%
36865 1
 
< 0.1%
7948 1
 
< 0.1%
1227 1
 
< 0.1%
1540 1
 
< 0.1%
26852 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
4 1
< 0.1%
17 1
< 0.1%
24 1
< 0.1%
40 1
< 0.1%
45 1
< 0.1%
53 1
< 0.1%
54 1
< 0.1%
65 1
< 0.1%
66 1
< 0.1%
69 1
< 0.1%
ValueCountFrequency (%)
68763 1
< 0.1%
68754 1
< 0.1%
68753 1
< 0.1%
68750 1
< 0.1%
68743 1
< 0.1%
68741 1
< 0.1%
68739 1
< 0.1%
68735 1
< 0.1%
68730 1
< 0.1%
68728 1
< 0.1%
Distinct6520
Distinct (%)65.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T13:21:34.666896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length3
Mean length5.3824
Min length1

Characters and Unicode

Total characters53824
Distinct characters703
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5555 ?
Unique (%)55.5%

Sample

1st row장*순(신흥컨설팅)
2nd row(*)현대테크,박명순
3rd row3*3호 이성우
4th row주*진
5th row김*배
ValueCountFrequency (%)
김*자 57
 
0.5%
김*수 56
 
0.5%
김*순 51
 
0.5%
김*숙 50
 
0.5%
김*희 43
 
0.4%
이*희 38
 
0.4%
김*호 34
 
0.3%
김*식 34
 
0.3%
박*수 30
 
0.3%
이*자 30
 
0.3%
Other values (6736) 10238
96.0%
2023-12-12T13:21:35.432904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 10000
 
18.6%
( 2971
 
5.5%
) 2958
 
5.5%
1806
 
3.4%
1479
 
2.7%
1186
 
2.2%
1 771
 
1.4%
734
 
1.4%
0 714
 
1.3%
713
 
1.3%
Other values (693) 30492
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33542
62.3%
Other Punctuation 10085
 
18.7%
Decimal Number 3069
 
5.7%
Open Punctuation 2983
 
5.5%
Close Punctuation 2971
 
5.5%
Space Separator 676
 
1.3%
Uppercase Letter 429
 
0.8%
Dash Punctuation 45
 
0.1%
Lowercase Letter 22
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1806
 
5.4%
1479
 
4.4%
1186
 
3.5%
734
 
2.2%
713
 
2.1%
525
 
1.6%
478
 
1.4%
448
 
1.3%
423
 
1.3%
380
 
1.1%
Other values (634) 25370
75.6%
Uppercase Letter
ValueCountFrequency (%)
A 78
18.2%
B 66
15.4%
C 35
 
8.2%
T 32
 
7.5%
S 28
 
6.5%
P 22
 
5.1%
K 18
 
4.2%
E 18
 
4.2%
D 17
 
4.0%
N 14
 
3.3%
Other values (12) 101
23.5%
Lowercase Letter
ValueCountFrequency (%)
s 3
13.6%
k 3
13.6%
c 3
13.6%
e 3
13.6%
p 2
9.1%
d 2
9.1%
t 1
 
4.5%
o 1
 
4.5%
r 1
 
4.5%
x 1
 
4.5%
Other values (2) 2
9.1%
Decimal Number
ValueCountFrequency (%)
1 771
25.1%
0 714
23.3%
2 707
23.0%
3 353
11.5%
5 179
 
5.8%
4 161
 
5.2%
6 62
 
2.0%
7 51
 
1.7%
8 41
 
1.3%
9 30
 
1.0%
Other Punctuation
ValueCountFrequency (%)
* 10000
99.2%
, 45
 
0.4%
. 23
 
0.2%
/ 7
 
0.1%
& 6
 
0.1%
: 2
 
< 0.1%
2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2971
99.6%
[ 12
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 2958
99.6%
] 13
 
0.4%
Space Separator
ValueCountFrequency (%)
676
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33542
62.3%
Common 19830
36.8%
Latin 452
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1806
 
5.4%
1479
 
4.4%
1186
 
3.5%
734
 
2.2%
713
 
2.1%
525
 
1.6%
478
 
1.4%
448
 
1.3%
423
 
1.3%
380
 
1.1%
Other values (634) 25370
75.6%
Latin
ValueCountFrequency (%)
A 78
17.3%
B 66
14.6%
C 35
 
7.7%
T 32
 
7.1%
S 28
 
6.2%
P 22
 
4.9%
K 18
 
4.0%
E 18
 
4.0%
D 17
 
3.8%
N 14
 
3.1%
Other values (25) 124
27.4%
Common
ValueCountFrequency (%)
* 10000
50.4%
( 2971
 
15.0%
) 2958
 
14.9%
1 771
 
3.9%
0 714
 
3.6%
2 707
 
3.6%
676
 
3.4%
3 353
 
1.8%
5 179
 
0.9%
4 161
 
0.8%
Other values (14) 340
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33542
62.3%
ASCII 20279
37.7%
None 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 10000
49.3%
( 2971
 
14.7%
) 2958
 
14.6%
1 771
 
3.8%
0 714
 
3.5%
2 707
 
3.5%
676
 
3.3%
3 353
 
1.7%
5 179
 
0.9%
4 161
 
0.8%
Other values (47) 789
 
3.9%
Hangul
ValueCountFrequency (%)
1806
 
5.4%
1479
 
4.4%
1186
 
3.5%
734
 
2.2%
713
 
2.1%
525
 
1.6%
478
 
1.4%
448
 
1.3%
423
 
1.3%
380
 
1.1%
Other values (634) 25370
75.6%
None
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

구역
Text

Distinct227
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T13:21:35.867524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length23
Mean length23.4285
Min length18

Characters and Unicode

Total characters234285
Distinct characters128
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row부산광역시 강서 사업소 가락동
2nd row부산광역시 강서 사업소 녹산동
3rd row부산광역시 기장 사업소 기장읍
4th row부산광역시 기장 사업소 정관읍
5th row부산광역시 강서 사업소 가덕도동
ValueCountFrequency (%)
부산광역시 10000
25.2%
사업소 8788
22.1%
강서 3472
 
8.8%
기장 2003
 
5.0%
동래통합사업소 1181
 
3.0%
녹산동 868
 
2.2%
남부 628
 
1.6%
대저1동 619
 
1.6%
부산진 587
 
1.5%
대저2동 542
 
1.4%
Other values (223) 10988
27.7%
2023-12-12T13:21:36.455641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93312
39.8%
12702
 
5.4%
11753
 
5.0%
10491
 
4.5%
10444
 
4.5%
10374
 
4.4%
10041
 
4.3%
10009
 
4.3%
9969
 
4.3%
9865
 
4.2%
Other values (118) 45325
19.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 136419
58.2%
Space Separator 93312
39.8%
Decimal Number 4554
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12702
 
9.3%
11753
 
8.6%
10491
 
7.7%
10444
 
7.7%
10374
 
7.6%
10041
 
7.4%
10009
 
7.3%
9969
 
7.3%
9865
 
7.2%
3863
 
2.8%
Other values (109) 36908
27.1%
Decimal Number
ValueCountFrequency (%)
1 1907
41.9%
2 1651
36.3%
3 523
 
11.5%
4 225
 
4.9%
5 119
 
2.6%
6 94
 
2.1%
9 24
 
0.5%
8 11
 
0.2%
Space Separator
ValueCountFrequency (%)
93312
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 136419
58.2%
Common 97866
41.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12702
 
9.3%
11753
 
8.6%
10491
 
7.7%
10444
 
7.7%
10374
 
7.6%
10041
 
7.4%
10009
 
7.3%
9969
 
7.3%
9865
 
7.2%
3863
 
2.8%
Other values (109) 36908
27.1%
Common
ValueCountFrequency (%)
93312
95.3%
1 1907
 
1.9%
2 1651
 
1.7%
3 523
 
0.5%
4 225
 
0.2%
5 119
 
0.1%
6 94
 
0.1%
9 24
 
< 0.1%
8 11
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 136419
58.2%
ASCII 97866
41.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
93312
95.3%
1 1907
 
1.9%
2 1651
 
1.7%
3 523
 
0.5%
4 225
 
0.2%
5 119
 
0.1%
6 94
 
0.1%
9 24
 
< 0.1%
8 11
 
< 0.1%
Hangul
ValueCountFrequency (%)
12702
 
9.3%
11753
 
8.6%
10491
 
7.7%
10444
 
7.7%
10374
 
7.6%
10041
 
7.4%
10009
 
7.3%
9969
 
7.3%
9865
 
7.2%
3863
 
2.8%
Other values (109) 36908
27.1%

주소
Text

Distinct9646
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T13:21:36.930115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length79
Mean length29.3889
Min length6

Characters and Unicode

Total characters293889
Distinct characters672
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9436 ?
Unique (%)94.4%

Sample

1st row가락대로1303번길 63 (봉림동) /가락동 744-25
2nd row미음국제2로 110 /녹산동 10-8
3rd row기장읍 동부리 330-8 /기장읍 차성동로 94-7 (수안빌라 A동)
4th row정관읍 달산1길 49 (제일좋은교회)
5th row동선새바지길 176-5 (동선동) /동선동 151
ValueCountFrequency (%)
부산광역시 1742
 
3.4%
대저1동 1091
 
2.1%
대저2동 989
 
1.9%
명지동 717
 
1.4%
강동동 693
 
1.4%
장안읍 650
 
1.3%
기장읍 577
 
1.1%
정관읍 535
 
1.1%
녹산동 409
 
0.8%
철마면 401
 
0.8%
Other values (14869) 42943
84.6%
2023-12-12T13:21:37.526306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42745
 
14.5%
1 19209
 
6.5%
17293
 
5.9%
2 13468
 
4.6%
- 10297
 
3.5%
3 10007
 
3.4%
/ 9684
 
3.3%
) 8630
 
2.9%
( 8613
 
2.9%
4 7794
 
2.7%
Other values (662) 146149
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125205
42.6%
Decimal Number 86765
29.5%
Space Separator 42745
 
14.5%
Dash Punctuation 10300
 
3.5%
Other Punctuation 9776
 
3.3%
Close Punctuation 8662
 
2.9%
Open Punctuation 8644
 
2.9%
Uppercase Letter 1121
 
0.4%
Lowercase Letter 647
 
0.2%
Math Symbol 13
 
< 0.1%
Other values (3) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17293
 
13.8%
7557
 
6.0%
7266
 
5.8%
5392
 
4.3%
4217
 
3.4%
4156
 
3.3%
2685
 
2.1%
2494
 
2.0%
2410
 
1.9%
2248
 
1.8%
Other values (585) 69487
55.5%
Uppercase Letter
ValueCountFrequency (%)
F 236
21.1%
B 193
17.2%
D 141
12.6%
A 130
11.6%
E 106
9.5%
C 56
 
5.0%
P 51
 
4.5%
L 44
 
3.9%
T 23
 
2.1%
S 21
 
1.9%
Other values (14) 120
10.7%
Lowercase Letter
ValueCountFrequency (%)
f 220
34.0%
e 104
16.1%
o 104
16.1%
d 91
14.1%
a 41
 
6.3%
c 27
 
4.2%
b 18
 
2.8%
s 5
 
0.8%
i 5
 
0.8%
l 5
 
0.8%
Other values (9) 27
 
4.2%
Decimal Number
ValueCountFrequency (%)
1 19209
22.1%
2 13468
15.5%
3 10007
11.5%
4 7794
9.0%
5 7245
 
8.4%
6 6502
 
7.5%
7 6228
 
7.2%
0 5688
 
6.6%
8 5585
 
6.4%
9 5039
 
5.8%
Other Punctuation
ValueCountFrequency (%)
/ 9684
99.1%
: 46
 
0.5%
; 17
 
0.2%
. 11
 
0.1%
, 9
 
0.1%
& 4
 
< 0.1%
@ 3
 
< 0.1%
! 1
 
< 0.1%
# 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 11
84.6%
~ 1
 
7.7%
+ 1
 
7.7%
Other Symbol
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 10297
> 99.9%
3
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 8630
99.6%
] 32
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 8613
99.6%
[ 31
 
0.4%
Space Separator
ValueCountFrequency (%)
42745
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 166916
56.8%
Hangul 125205
42.6%
Latin 1768
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17293
 
13.8%
7557
 
6.0%
7266
 
5.8%
5392
 
4.3%
4217
 
3.4%
4156
 
3.3%
2685
 
2.1%
2494
 
2.0%
2410
 
1.9%
2248
 
1.8%
Other values (585) 69487
55.5%
Latin
ValueCountFrequency (%)
F 236
13.3%
f 220
12.4%
B 193
10.9%
D 141
 
8.0%
A 130
 
7.4%
E 106
 
6.0%
e 104
 
5.9%
o 104
 
5.9%
d 91
 
5.1%
C 56
 
3.2%
Other values (33) 387
21.9%
Common
ValueCountFrequency (%)
42745
25.6%
1 19209
11.5%
2 13468
 
8.1%
- 10297
 
6.2%
3 10007
 
6.0%
/ 9684
 
5.8%
) 8630
 
5.2%
( 8613
 
5.2%
4 7794
 
4.7%
5 7245
 
4.3%
Other values (24) 29224
17.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 168677
57.4%
Hangul 125203
42.6%
Punctuation 3
 
< 0.1%
Geometric Shapes 3
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42745
25.3%
1 19209
11.4%
2 13468
 
8.0%
- 10297
 
6.1%
3 10007
 
5.9%
/ 9684
 
5.7%
) 8630
 
5.1%
( 8613
 
5.1%
4 7794
 
4.6%
5 7245
 
4.3%
Other values (63) 30985
18.4%
Hangul
ValueCountFrequency (%)
17293
 
13.8%
7557
 
6.0%
7266
 
5.8%
5392
 
4.3%
4217
 
3.4%
4156
 
3.3%
2685
 
2.1%
2494
 
2.0%
2410
 
1.9%
2248
 
1.8%
Other values (584) 69485
55.5%
Punctuation
ValueCountFrequency (%)
3
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
66.7%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T13:21:33.489375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T13:21:33.662265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:21:33.785133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호수용가명구역주소
3945139452장*순(신흥컨설팅)부산광역시 강서 사업소 가락동가락대로1303번길 63 (봉림동) /가락동 744-25
6377463775(*)현대테크,박명순부산광역시 강서 사업소 녹산동미음국제2로 110 /녹산동 10-8
50473504743*3호 이성우부산광역시 기장 사업소 기장읍기장읍 동부리 330-8 /기장읍 차성동로 94-7 (수안빌라 A동)
4389143892주*진부산광역시 기장 사업소 정관읍정관읍 달산1길 49 (제일좋은교회)
5823458235김*배부산광역시 강서 사업소 가덕도동동선새바지길 176-5 (동선동) /동선동 151
3438134382이*일부산광역시 기장 사업소 구역 미지정 수용가일광읍 이천3길 19-3
2575525756이*영부산광역시 강서 사업소 가락동봉죽길 482번길 292 (죽동동) /죽동동 540
6122861229김*태부산광역시 강서 사업소 명지동명지국제12로 7-3 (명지동/ 아트빌) /명지동 3503-4 아트빌
6575165752명*더샵퍼스트월드2단지 상가부산광역시 강서 사업소 명지동명지국제7로 37 (명지동/ 더샵 명지퍼스트월드 2단지 관리사무소)
2316023161성*수부산광역시 강서 사업소 대저2동부산광역시 맥도길545번길 27-1 (대저2동) /대저2동 4826
번호수용가명구역주소
6699866999(*)효경테크(삼인이엔지)부산광역시 강서 사업소 녹산동미음국제5로가길 4 (미음동)
4750647507송*희부산광역시 강서 사업소 대저1동대저로273번길 18 (대저1동) /대저1동 2388-11
3999439995강*일부산광역시 기장 사업소 기장읍기장읍 배산로68번길 11-17
6651166512최*석(농막)부산광역시 강서 사업소 가락동강서구 봉림동 828
30703071권*오부산광역시 중동부 사업소 초량6동부산광역시 초량6동 827 /초량로108번길 29 (초량동/ 경희@104)
1067210673박*환부산광역시 동래통합사업소 사직2동사직2동 92-5(샤브향) /사직북로5번길 9 (사직동)
3885838859이*순부산광역시 강서 사업소 녹산동생곡길26번길 22 (생곡동) /생곡동 354-3번지
95019502박*석(안경원고)부산광역시 동래통합사업소 수민동부산광역시 수안동 545 가정집수도 /충렬대로237번길 7-6 (수안동)
5057450575(*)동신하이드로릭부산광역시 강서 사업소 대저2동부산광역시 유통단지1로97번길 12 / (220호)(대저2동) /대저2동 3144 (220동)
2074720748삼*빌라부산광역시 동래통합사업소 남산동부산광역시 남산동 481-3 /금강로 659 (남산동/ 남산삼전빌라)