Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory498.0 KiB
Average record size in memory51.0 B

Variable types

Numeric3
Text1
DateTime1

Dataset

Description경상남도 진주시 스마트워터미터기(수도사용량 원격검침기) 설치 현황내역이며, 검침원이 직접 방문하지 않고 원격으로 수도 사용량을 확인 가능한 스마트워터미터기 주소 및 위치정보 자료입니다.
URLhttps://www.data.go.kr/data/15103321/fileData.do

Alerts

데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 위도 and 1 other fieldsHigh correlation
위도 is highly overall correlated with 연번High correlation
경도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:35:06.921257
Analysis finished2023-12-12 12:35:08.808746
Duration1.89 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11049.047
Minimum4
Maximum21994
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:35:08.891839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile1131.8
Q15574.75
median11096.5
Q316455.25
95-th percentile20911.1
Maximum21994
Range21990
Interquartile range (IQR)10880.5

Descriptive statistics

Standard deviation6341.9924
Coefficient of variation (CV)0.57398549
Kurtosis-1.1958669
Mean11049.047
Median Absolute Deviation (MAD)5452.5
Skewness-0.0090332413
Sum1.1049047 × 108
Variance40220868
MonotonicityNot monotonic
2023-12-12T21:35:09.034033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15220 1
 
< 0.1%
4699 1
 
< 0.1%
13550 1
 
< 0.1%
17755 1
 
< 0.1%
2645 1
 
< 0.1%
19779 1
 
< 0.1%
18108 1
 
< 0.1%
11872 1
 
< 0.1%
2015 1
 
< 0.1%
17238 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
4 1
< 0.1%
6 1
< 0.1%
9 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
18 1
< 0.1%
20 1
< 0.1%
21 1
< 0.1%
24 1
< 0.1%
27 1
< 0.1%
ValueCountFrequency (%)
21994 1
< 0.1%
21993 1
< 0.1%
21992 1
< 0.1%
21990 1
< 0.1%
21989 1
< 0.1%
21988 1
< 0.1%
21987 1
< 0.1%
21984 1
< 0.1%
21982 1
< 0.1%
21980 1
< 0.1%
Distinct9851
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:35:09.308667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length47
Mean length26.5053
Min length14

Characters and Unicode

Total characters265053
Distinct characters530
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9707 ?
Unique (%)97.1%

Sample

1st row경상남도 진주시 망경북길 25-10, (4/3) (망경동)
2nd row경상남도 진주시 이반성면 장안로54번길 21
3rd row경상남도 진주시 상봉대룡길5번길 5 (상봉동)
4th row경상남도 진주시 정촌면 진주대로130번길 19-7 (예하리)
5th row경상남도 진주시 사봉면 모곡길 76-28 (봉곡리)
ValueCountFrequency (%)
진주시 10010
 
18.7%
경상남도 10001
 
18.7%
문산읍 1067
 
2.0%
상평동 661
 
1.2%
금곡면 640
 
1.2%
진성면 634
 
1.2%
정촌면 618
 
1.2%
수곡면 607
 
1.1%
이반성면 568
 
1.1%
일반성면 558
 
1.0%
Other values (5593) 28038
52.5%
2023-12-12T21:35:09.741401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43898
 
16.6%
11727
 
4.4%
11062
 
4.2%
11042
 
4.2%
10983
 
4.1%
10804
 
4.1%
10089
 
3.8%
10081
 
3.8%
1 9707
 
3.7%
8822
 
3.3%
Other values (520) 126838
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 157077
59.3%
Decimal Number 44973
 
17.0%
Space Separator 43898
 
16.6%
Open Punctuation 6483
 
2.4%
Close Punctuation 6471
 
2.4%
Dash Punctuation 4937
 
1.9%
Other Punctuation 1052
 
0.4%
Uppercase Letter 156
 
0.1%
Lowercase Letter 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11727
 
7.5%
11062
 
7.0%
11042
 
7.0%
10983
 
7.0%
10804
 
6.9%
10089
 
6.4%
10081
 
6.4%
8822
 
5.6%
6954
 
4.4%
6081
 
3.9%
Other values (476) 59432
37.8%
Uppercase Letter
ValueCountFrequency (%)
C 28
17.9%
E 20
12.8%
T 18
11.5%
I 18
11.5%
H 18
11.5%
S 13
8.3%
B 8
 
5.1%
M 6
 
3.8%
L 6
 
3.8%
A 5
 
3.2%
Other values (8) 16
10.3%
Decimal Number
ValueCountFrequency (%)
1 9707
21.6%
2 5886
13.1%
3 4731
10.5%
5 4338
9.6%
4 3893
8.7%
9 3753
 
8.3%
6 3584
 
8.0%
7 3115
 
6.9%
8 3007
 
6.7%
0 2959
 
6.6%
Other Punctuation
ValueCountFrequency (%)
, 648
61.6%
/ 356
33.8%
. 41
 
3.9%
: 5
 
0.5%
& 2
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
t 1
25.0%
c 1
25.0%
e 1
25.0%
i 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 6403
98.8%
[ 80
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 6391
98.8%
] 80
 
1.2%
Space Separator
ValueCountFrequency (%)
43898
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4937
100.0%
Math Symbol
ValueCountFrequency (%)
> 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 157077
59.3%
Common 107816
40.7%
Latin 160
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11727
 
7.5%
11062
 
7.0%
11042
 
7.0%
10983
 
7.0%
10804
 
6.9%
10089
 
6.4%
10081
 
6.4%
8822
 
5.6%
6954
 
4.4%
6081
 
3.9%
Other values (476) 59432
37.8%
Common
ValueCountFrequency (%)
43898
40.7%
1 9707
 
9.0%
( 6403
 
5.9%
) 6391
 
5.9%
2 5886
 
5.5%
- 4937
 
4.6%
3 4731
 
4.4%
5 4338
 
4.0%
4 3893
 
3.6%
9 3753
 
3.5%
Other values (12) 13879
 
12.9%
Latin
ValueCountFrequency (%)
C 28
17.5%
E 20
12.5%
T 18
11.2%
I 18
11.2%
H 18
11.2%
S 13
8.1%
B 8
 
5.0%
M 6
 
3.8%
L 6
 
3.8%
A 5
 
3.1%
Other values (12) 20
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 157077
59.3%
ASCII 107976
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43898
40.7%
1 9707
 
9.0%
( 6403
 
5.9%
) 6391
 
5.9%
2 5886
 
5.5%
- 4937
 
4.6%
3 4731
 
4.4%
5 4338
 
4.0%
4 3893
 
3.6%
9 3753
 
3.5%
Other values (34) 14039
 
13.0%
Hangul
ValueCountFrequency (%)
11727
 
7.5%
11062
 
7.0%
11042
 
7.0%
10983
 
7.0%
10804
 
6.9%
10089
 
6.4%
10081
 
6.4%
8822
 
5.6%
6954
 
4.4%
6081
 
3.9%
Other values (476) 59432
37.8%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct9659
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.643973
Minimum35.06946
Maximum128.35496
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:35:09.883331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.06946
5-th percentile35.118006
Q135.165519
median35.176041
Q335.187385
95-th percentile35.28228
Maximum128.35496
Range93.285497
Interquartile range (IQR)0.02186665

Descriptive statistics

Standard deviation19.888236
Coefficient of variation (CV)0.50167112
Kurtosis15.849012
Mean39.643973
Median Absolute Deviation (MAD)0.011145585
Skewness4.2244268
Sum396439.73
Variance395.54194
MonotonicityNot monotonic
2023-12-12T21:35:10.018526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.1312427 128
 
1.3%
35.20021725 10
 
0.1%
35.17087608 6
 
0.1%
35.19823337 5
 
0.1%
35.12876344 4
 
< 0.1%
35.17311381 4
 
< 0.1%
35.17324872 4
 
< 0.1%
35.16534348 3
 
< 0.1%
35.18241439 3
 
< 0.1%
35.1943972 3
 
< 0.1%
Other values (9649) 9830
98.3%
ValueCountFrequency (%)
35.06946004 1
< 0.1%
35.06946074 1
< 0.1%
35.06948872 1
< 0.1%
35.06954157 1
< 0.1%
35.06960628 1
< 0.1%
35.06979537 1
< 0.1%
35.06983104 1
< 0.1%
35.06994799 1
< 0.1%
35.07000299 1
< 0.1%
35.07002953 1
< 0.1%
ValueCountFrequency (%)
128.3549568 1
< 0.1%
128.3457674 1
< 0.1%
128.3070266 1
< 0.1%
128.3017215 1
< 0.1%
128.2958407 1
< 0.1%
128.2837373 1
< 0.1%
128.2741251 1
< 0.1%
128.2705131 1
< 0.1%
128.2683201 1
< 0.1%
128.2673373 1
< 0.1%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct9648
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean123.67729
Minimum35.072884
Maximum128.35643
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:35:10.177379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.072884
5-th percentile127.89541
Q1128.08522
median128.12249
Q3128.22183
95-th percentile128.31386
Maximum128.35643
Range93.283551
Interquartile range (IQR)0.13661733

Descriptive statistics

Standard deviation19.891245
Coefficient of variation (CV)0.16083182
Kurtosis15.848163
Mean123.67729
Median Absolute Deviation (MAD)0.05642135
Skewness-4.2242696
Sum1236772.9
Variance395.66162
MonotonicityNot monotonic
2023-12-12T21:35:10.315406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.20021725 128
 
1.3%
128.1312427 10
 
0.1%
128.2827601 6
 
0.1%
127.9301279 5
 
0.1%
128.099438 4
 
< 0.1%
128.100349 4
 
< 0.1%
128.1007115 4
 
< 0.1%
128.1107578 3
 
< 0.1%
127.929784 3
 
< 0.1%
127.9375325 3
 
< 0.1%
Other values (9638) 9830
98.3%
ValueCountFrequency (%)
35.07288361 1
< 0.1%
35.11205758 1
< 0.1%
35.11228604 1
< 0.1%
35.11505725 1
< 0.1%
35.12039369 1
< 0.1%
35.12260504 1
< 0.1%
35.12493292 1
< 0.1%
35.13010464 1
< 0.1%
35.1328535 1
< 0.1%
35.13520462 1
< 0.1%
ValueCountFrequency (%)
128.3564346 1
< 0.1%
128.3558147 1
< 0.1%
128.3557947 1
< 0.1%
128.3557713 1
< 0.1%
128.3557497 1
< 0.1%
128.3556851 1
< 0.1%
128.3556595 1
< 0.1%
128.3555644 1
< 0.1%
128.3555074 1
< 0.1%
128.3553296 1
< 0.1%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-08-30 00:00:00
Maximum2023-08-30 00:00:00
2023-12-12T21:35:10.495573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:10.585685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T21:35:08.311095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:07.709737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:08.042734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:08.412048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:07.819620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:08.131728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:08.518831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:07.935669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:35:08.225931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:35:10.650495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도
연번1.0000.8340.834
위도0.8341.0001.000
경도0.8341.0001.000
2023-12-12T21:35:10.735602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도
연번1.0000.519-0.548
위도0.5191.000-0.298
경도-0.548-0.2981.000

Missing values

2023-12-12T21:35:08.652327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:35:08.764908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번도로명주소위도경도데이터기준일
1521915220경상남도 진주시 망경북길 25-10, (4/3) (망경동)35.185559128.0813472023-08-30
78117812경상남도 진주시 이반성면 장안로54번길 2135.188443128.3395472023-08-30
2171921647경상남도 진주시 상봉대룡길5번길 5 (상봉동)128.06834835.200092023-08-30
33863387경상남도 진주시 정촌면 진주대로130번길 19-7 (예하리)35.120337128.0966962023-08-30
98999900경상남도 진주시 사봉면 모곡길 76-28 (봉곡리)35.196814128.2964022023-08-30
2023520236경상남도 진주시 영천강로68번길 12 (충무공동, 미드테이블 진주점)35.169371128.1379552023-08-30
77907791경상남도 진주시 이반성면 장안로54번길 2435.188775128.3399112023-08-30
1473814739경상남도 진주시 진주대로829번길 635.174779128.0925672023-08-30
1721717218경상남도 진주시 남강로1093번길 8, 진진복집 (상평동)35.168903128.1116952023-08-30
20802081경상남도 진주시 문산읍 동부로302번길 5-16 (옥산리)35.161864128.1365262023-08-30
연번도로명주소위도경도데이터기준일
1529415295경상남도 진주시 망경로 25735.184288128.0815652023-08-30
1258212583경상남도 진주시 수곡면 중전길101번길 235.187609127.9577972023-08-30
48944895경상남도 진주시 금곡면 구암두문로 1016-1 (두문리)35.090259128.1860882023-08-30
1837318374경상남도 진주시 돗골로117번길 33-1 (상평동) 401호35.178407128.1086892023-08-30
1213512136경상남도 진주시 수곡면 사곡로213번길 4835.217331127.9106252023-08-30
54485449경상남도 진주시 진성면 동부로1345번길 24-1135.187841128.2225252023-08-30
1986719868경상남도 진주시 판문오동길 45-4 (판문동)35.171779128.0412952023-08-30
192193경상남도 진주시 문산읍 동부로709번길 14 (상문리)35.172037128.1749572023-08-30
1842218423경상남도 진주시 솔밭로80번길 11-1 (상평동)35.176487128.1082142023-08-30
2146121462경상남도 진주시 대곡면 와룡리 279-1128.18300635.2701822023-08-30