Overview

Dataset statistics

Number of variables3
Number of observations3770
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory92.2 KiB
Average record size in memory25.0 B

Variable types

Numeric1
Text1
Categorical1

Dataset

Description현재 포항시 내 스마트워터미터기(수도 원격검침) 설치 완료 및 활용되고 있는 주소지", "수도 원격 검침은 검침원이 직접 방문하지 않고 원격으로 수도 사용량을 확인이 가능함
Author경상북도 포항시
URLhttps://www.data.go.kr/data/15103132/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:51:24.613227
Analysis finished2023-12-12 23:51:25.177491
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct3770
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1885.5
Minimum1
Maximum3770
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size33.3 KiB
2023-12-13T08:51:25.261324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile189.45
Q1943.25
median1885.5
Q32827.75
95-th percentile3581.55
Maximum3770
Range3769
Interquartile range (IQR)1884.5

Descriptive statistics

Standard deviation1088.4496
Coefficient of variation (CV)0.57727371
Kurtosis-1.2
Mean1885.5
Median Absolute Deviation (MAD)942.5
Skewness0
Sum7108335
Variance1184722.5
MonotonicityStrictly increasing
2023-12-13T08:51:25.418159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2519 1
 
< 0.1%
2507 1
 
< 0.1%
2508 1
 
< 0.1%
2509 1
 
< 0.1%
2510 1
 
< 0.1%
2511 1
 
< 0.1%
2512 1
 
< 0.1%
2513 1
 
< 0.1%
2514 1
 
< 0.1%
Other values (3760) 3760
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3770 1
< 0.1%
3769 1
< 0.1%
3768 1
< 0.1%
3767 1
< 0.1%
3766 1
< 0.1%
3765 1
< 0.1%
3764 1
< 0.1%
3763 1
< 0.1%
3762 1
< 0.1%
3761 1
< 0.1%
Distinct3742
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size29.6 KiB
2023-12-13T08:51:25.786026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length52
Mean length25.388064
Min length18

Characters and Unicode

Total characters95713
Distinct characters550
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3716 ?
Unique (%)98.6%

Sample

1st row경상북도 포항시 북구 흥해읍 마산1리 147-12
2nd row경상북도 포항시 북구 흥해읍 마산리 39-1
3rd row경상북도 포항시 북구 흥해읍 마산1리 88
4th row경상북도 포항시 북구 흥해읍 마산1리 264
5th row경상북도 포항시 북구 흥해읍 마산리 140
ValueCountFrequency (%)
포항시 3798
17.2%
경상북도 3793
17.2%
북구 2797
 
12.7%
남구 1001
 
4.5%
흥해읍 575
 
2.6%
죽도1등 542
 
2.5%
기계면 525
 
2.4%
대송면 174
 
0.8%
화대리 155
 
0.7%
오천읍 124
 
0.6%
Other values (4409) 8561
38.8%
2023-12-13T08:51:26.418564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18289
19.1%
6658
 
7.0%
4854
 
5.1%
4084
 
4.3%
4025
 
4.2%
3977
 
4.2%
1 3934
 
4.1%
3838
 
4.0%
3816
 
4.0%
3812
 
4.0%
Other values (540) 38426
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55842
58.3%
Space Separator 18289
 
19.1%
Decimal Number 16651
 
17.4%
Dash Punctuation 2835
 
3.0%
Math Symbol 1271
 
1.3%
Open Punctuation 317
 
0.3%
Close Punctuation 317
 
0.3%
Other Punctuation 88
 
0.1%
Uppercase Letter 83
 
0.1%
Lowercase Letter 20
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6658
 
11.9%
4854
 
8.7%
4084
 
7.3%
4025
 
7.2%
3977
 
7.1%
3838
 
6.9%
3816
 
6.8%
3812
 
6.8%
1904
 
3.4%
1588
 
2.8%
Other values (488) 17286
31.0%
Uppercase Letter
ValueCountFrequency (%)
A 16
19.3%
B 14
16.9%
L 12
14.5%
G 9
10.8%
P 7
8.4%
T 5
 
6.0%
C 4
 
4.8%
S 4
 
4.8%
M 2
 
2.4%
Y 2
 
2.4%
Other values (7) 8
9.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
15.0%
u 3
15.0%
r 3
15.0%
o 2
10.0%
t 2
10.0%
l 1
 
5.0%
f 1
 
5.0%
c 1
 
5.0%
a 1
 
5.0%
m 1
 
5.0%
Other values (2) 2
10.0%
Decimal Number
ValueCountFrequency (%)
1 3934
23.6%
2 2558
15.4%
3 1631
9.8%
4 1626
9.8%
5 1581
9.5%
6 1253
 
7.5%
7 1089
 
6.5%
9 1045
 
6.3%
8 993
 
6.0%
0 941
 
5.7%
Other Punctuation
ValueCountFrequency (%)
. 57
64.8%
, 23
26.1%
: 6
 
6.8%
& 1
 
1.1%
/ 1
 
1.1%
Math Symbol
ValueCountFrequency (%)
> 637
50.1%
< 634
49.9%
Open Punctuation
ValueCountFrequency (%)
( 316
99.7%
[ 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 316
99.7%
] 1
 
0.3%
Space Separator
ValueCountFrequency (%)
18289
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2835
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55841
58.3%
Common 39768
41.5%
Latin 103
 
0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6658
 
11.9%
4854
 
8.7%
4084
 
7.3%
4025
 
7.2%
3977
 
7.1%
3838
 
6.9%
3816
 
6.8%
3812
 
6.8%
1904
 
3.4%
1588
 
2.8%
Other values (487) 17285
31.0%
Latin
ValueCountFrequency (%)
A 16
15.5%
B 14
13.6%
L 12
11.7%
G 9
 
8.7%
P 7
 
6.8%
T 5
 
4.9%
C 4
 
3.9%
S 4
 
3.9%
e 3
 
2.9%
u 3
 
2.9%
Other values (19) 26
25.2%
Common
ValueCountFrequency (%)
18289
46.0%
1 3934
 
9.9%
- 2835
 
7.1%
2 2558
 
6.4%
3 1631
 
4.1%
4 1626
 
4.1%
5 1581
 
4.0%
6 1253
 
3.2%
7 1089
 
2.7%
9 1045
 
2.6%
Other values (13) 3927
 
9.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55841
58.3%
ASCII 39871
41.7%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18289
45.9%
1 3934
 
9.9%
- 2835
 
7.1%
2 2558
 
6.4%
3 1631
 
4.1%
4 1626
 
4.1%
5 1581
 
4.0%
6 1253
 
3.1%
7 1089
 
2.7%
9 1045
 
2.6%
Other values (42) 4030
 
10.1%
Hangul
ValueCountFrequency (%)
6658
 
11.9%
4854
 
8.7%
4084
 
7.3%
4025
 
7.2%
3977
 
7.1%
3838
 
6.9%
3816
 
6.8%
3812
 
6.8%
1904
 
3.4%
1588
 
2.8%
Other values (487) 17285
31.0%
CJK
ValueCountFrequency (%)
1
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size29.6 KiB
2022-08-03
3770 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-08-03
2nd row2022-08-03
3rd row2022-08-03
4th row2022-08-03
5th row2022-08-03

Common Values

ValueCountFrequency (%)
2022-08-03 3770
100.0%

Length

2023-12-13T08:51:26.576411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:51:26.675419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-08-03 3770
100.0%

Interactions

2023-12-13T08:51:24.945655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T08:51:25.070347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:51:25.141639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번설치주소데이터기준일자
01경상북도 포항시 북구 흥해읍 마산1리 147-122022-08-03
12경상북도 포항시 북구 흥해읍 마산리 39-12022-08-03
23경상북도 포항시 북구 흥해읍 마산1리 882022-08-03
34경상북도 포항시 북구 흥해읍 마산1리 2642022-08-03
45경상북도 포항시 북구 흥해읍 마산리 1402022-08-03
56경상북도 포항시 북구 흥해읍 마산2리 29-212022-08-03
67경상북도 포항시 북구 흥해읍 마산2리 29-38 <총무 207호>2022-08-03
78경상북도 포항시 북구 흥해읍 옥성리 155-12022-08-03
89경상북도 포항시 북구 흥해읍 옥성2리 248-17 (303호)2022-08-03
910경상북도 포항시 북구 흥해읍 약성리 2232022-08-03
순번설치주소데이터기준일자
37603761경상북도 포항시 남구 대잠동 998-142022-08-03
37613762경상북도 포항시 남구 이동 6452022-08-03
37623763경상북도 포항시 남구 이동 647-16 <경상북도 포항시 남구 이동시티캐슬 A>2022-08-03
37633764경상북도 포항시 남구 대잠동 468-5 <제일회타운>2022-08-03
37643765경상북도 포항시 남구 이동 54-8 <태경주유소>2022-08-03
37653766경상북도 포항시 남구 대잠동 940-82022-08-03
37663767경상북도 포항시 남구 대잠동 946-13 <401호>2022-08-03
37673768경상북도 포항시 남구 대잠동 938-2 <복권방>2022-08-03
37683769경상북도 포항시 남구 대잠동 919-19 (미니스톱)2022-08-03
37693770경상북도 포항시 남구 대잠동 994-15 <다인빌딩> (공동)2022-08-03