Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory498.0 KiB
Average record size in memory51.0 B

Variable types

Numeric3
Text1
Categorical1

Dataset

Description경상남도 진주시 스마트워터미터기(수도사용량 원격검침기) 설치 현황내역이며, 검침원이 직접 방문하지 않고 원격으로 수도 사용량을 확인 가능한 스마트워터미터기 주소 및 위치정보 자료입니다.
Author경상남도 진주시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15103321

Alerts

데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 위도 and 1 other fieldsHigh correlation
위도 is highly overall correlated with 연번High correlation
경도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:52:47.519037
Analysis finished2023-12-11 00:52:49.028411
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10987.61
Minimum1
Maximum21994
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:52:49.088285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1060.95
Q15462.75
median10980
Q316533
95-th percentile20854.1
Maximum21994
Range21993
Interquartile range (IQR)11070.25

Descriptive statistics

Standard deviation6362.1105
Coefficient of variation (CV)0.57902589
Kurtosis-1.209824
Mean10987.61
Median Absolute Deviation (MAD)5535
Skewness-0.0073156353
Sum1.098761 × 108
Variance40476450
MonotonicityNot monotonic
2023-12-11T09:52:49.198348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4868 1
 
< 0.1%
17135 1
 
< 0.1%
1773 1
 
< 0.1%
6116 1
 
< 0.1%
8725 1
 
< 0.1%
866 1
 
< 0.1%
4543 1
 
< 0.1%
17937 1
 
< 0.1%
10888 1
 
< 0.1%
4392 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
21994 1
< 0.1%
21992 1
< 0.1%
21990 1
< 0.1%
21988 1
< 0.1%
21985 1
< 0.1%
21984 1
< 0.1%
21982 1
< 0.1%
21979 1
< 0.1%
21977 1
< 0.1%
21976 1
< 0.1%
Distinct9862
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T09:52:49.472934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length46
Mean length26.5346
Min length14

Characters and Unicode

Total characters265346
Distinct characters524
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9728 ?
Unique (%)97.3%

Sample

1st row경상남도 진주시 금곡면 월아산로 94 (두문리)
2nd row경상남도 진주시 진주대로 980 (강남동)
3rd row경상남도 진주시 지수면 지수로 449번길 28-1
4th row경상남도 진주시 이반성면 오봉산로554번길 15-1
5th row경상남도 진주시 지수면 청원길 183 (지철마을)
ValueCountFrequency (%)
진주시 10011
 
18.7%
경상남도 10002
 
18.7%
문산읍 1114
 
2.1%
상평동 686
 
1.3%
정촌면 656
 
1.2%
금곡면 628
 
1.2%
수곡면 609
 
1.1%
일반성면 596
 
1.1%
진성면 570
 
1.1%
이반성면 558
 
1.0%
Other values (5590) 28082
52.5%
2023-12-11T09:52:49.852566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43984
 
16.6%
11605
 
4.4%
11096
 
4.2%
11054
 
4.2%
10994
 
4.1%
10774
 
4.1%
10090
 
3.8%
10088
 
3.8%
1 9855
 
3.7%
8838
 
3.3%
Other values (514) 126968
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 157422
59.3%
Decimal Number 44772
 
16.9%
Space Separator 43984
 
16.6%
Open Punctuation 6552
 
2.5%
Close Punctuation 6543
 
2.5%
Dash Punctuation 4934
 
1.9%
Other Punctuation 997
 
0.4%
Uppercase Letter 137
 
0.1%
Lowercase Letter 4
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11605
 
7.4%
11096
 
7.0%
11054
 
7.0%
10994
 
7.0%
10774
 
6.8%
10090
 
6.4%
10088
 
6.4%
8838
 
5.6%
6914
 
4.4%
6065
 
3.9%
Other values (469) 59904
38.1%
Uppercase Letter
ValueCountFrequency (%)
C 19
13.9%
I 18
13.1%
T 18
13.1%
E 17
12.4%
H 17
12.4%
B 11
8.0%
A 7
 
5.1%
L 6
 
4.4%
K 5
 
3.6%
S 3
 
2.2%
Other values (10) 16
11.7%
Decimal Number
ValueCountFrequency (%)
1 9855
22.0%
2 5810
13.0%
3 4645
10.4%
5 4264
9.5%
4 3863
 
8.6%
9 3725
 
8.3%
6 3518
 
7.9%
7 3235
 
7.2%
0 2940
 
6.6%
8 2917
 
6.5%
Other Punctuation
ValueCountFrequency (%)
, 619
62.1%
/ 343
34.4%
. 31
 
3.1%
: 4
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
i 1
25.0%
t 1
25.0%
e 1
25.0%
c 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 6472
98.8%
[ 80
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 6463
98.8%
] 80
 
1.2%
Space Separator
ValueCountFrequency (%)
43984
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4934
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 157422
59.3%
Common 107783
40.6%
Latin 141
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11605
 
7.4%
11096
 
7.0%
11054
 
7.0%
10994
 
7.0%
10774
 
6.8%
10090
 
6.4%
10088
 
6.4%
8838
 
5.6%
6914
 
4.4%
6065
 
3.9%
Other values (469) 59904
38.1%
Latin
ValueCountFrequency (%)
C 19
13.5%
I 18
12.8%
T 18
12.8%
E 17
12.1%
H 17
12.1%
B 11
7.8%
A 7
 
5.0%
L 6
 
4.3%
K 5
 
3.5%
S 3
 
2.1%
Other values (14) 20
14.2%
Common
ValueCountFrequency (%)
43984
40.8%
1 9855
 
9.1%
( 6472
 
6.0%
) 6463
 
6.0%
2 5810
 
5.4%
- 4934
 
4.6%
3 4645
 
4.3%
5 4264
 
4.0%
4 3863
 
3.6%
9 3725
 
3.5%
Other values (11) 13768
 
12.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 157422
59.3%
ASCII 107924
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43984
40.8%
1 9855
 
9.1%
( 6472
 
6.0%
) 6463
 
6.0%
2 5810
 
5.4%
- 4934
 
4.6%
3 4645
 
4.3%
5 4264
 
4.0%
4 3863
 
3.6%
9 3725
 
3.5%
Other values (35) 13909
 
12.9%
Hangul
ValueCountFrequency (%)
11605
 
7.4%
11096
 
7.0%
11054
 
7.0%
10994
 
7.0%
10774
 
6.8%
10090
 
6.4%
10088
 
6.4%
8838
 
5.6%
6914
 
4.4%
6065
 
3.9%
Other values (469) 59904
38.1%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct9677
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.262721
Minimum35.06928
Maximum128.34577
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:52:49.967099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.06928
5-th percentile35.117682
Q135.165204
median35.175906
Q335.187289
95-th percentile35.25413
Maximum128.34577
Range93.276487
Interquartile range (IQR)0.022084415

Descriptive statistics

Standard deviation19.062921
Coefficient of variation (CV)0.48552216
Kurtosis17.782706
Mean39.262721
Median Absolute Deviation (MAD)0.01121803
Skewness4.4473679
Sum392627.21
Variance363.39497
MonotonicityNot monotonic
2023-12-11T09:52:50.072080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.1312427 115
 
1.1%
35.20021725 10
 
0.1%
35.17087608 7
 
0.1%
35.20691092 4
 
< 0.1%
35.1734892 3
 
< 0.1%
35.10877207 3
 
< 0.1%
35.12876344 3
 
< 0.1%
35.19823337 3
 
< 0.1%
35.15936154 3
 
< 0.1%
35.23192817 3
 
< 0.1%
Other values (9667) 9846
98.5%
ValueCountFrequency (%)
35.06927991 1
< 0.1%
35.06946074 1
< 0.1%
35.06948872 1
< 0.1%
35.06954157 1
< 0.1%
35.06959679 1
< 0.1%
35.06960628 1
< 0.1%
35.06966806 1
< 0.1%
35.06986563 1
< 0.1%
35.07004596 1
< 0.1%
35.07008321 1
< 0.1%
ValueCountFrequency (%)
128.3457674 1
< 0.1%
128.3136788 1
< 0.1%
128.2861872 1
< 0.1%
128.2741251 1
< 0.1%
128.2705131 1
< 0.1%
128.2683201 1
< 0.1%
128.2674794 1
< 0.1%
128.2672585 1
< 0.1%
128.2672329 1
< 0.1%
128.2672087 1
< 0.1%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct9669
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124.05877
Minimum35.089226
Maximum128.35643
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:52:50.178372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.089226
5-th percentile127.89931
Q1128.08557
median128.12359
Q3128.22193
95-th percentile128.31279
Maximum128.35643
Range93.267209
Interquartile range (IQR)0.13635795

Descriptive statistics

Standard deviation19.065805
Coefficient of variation (CV)0.15368365
Kurtosis17.78169
Mean124.05877
Median Absolute Deviation (MAD)0.0523001
Skewness-4.4471894
Sum1240587.7
Variance363.50492
MonotonicityNot monotonic
2023-12-11T09:52:50.287246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.20021725 115
 
1.1%
128.1312427 10
 
0.1%
128.2827601 7
 
0.1%
127.8993131 4
 
< 0.1%
128.1107578 3
 
< 0.1%
128.2765923 3
 
< 0.1%
128.099438 3
 
< 0.1%
127.9447045 3
 
< 0.1%
127.9301279 3
 
< 0.1%
128.0951352 3
 
< 0.1%
Other values (9659) 9846
98.5%
ValueCountFrequency (%)
35.08922599 1
< 0.1%
35.11010403 1
< 0.1%
35.11205758 1
< 0.1%
35.11505725 1
< 0.1%
35.12229774 1
< 0.1%
35.12260504 1
< 0.1%
35.12493292 1
< 0.1%
35.13010464 1
< 0.1%
35.13162703 1
< 0.1%
35.13164187 1
< 0.1%
ValueCountFrequency (%)
128.3564346 1
< 0.1%
128.3558147 1
< 0.1%
128.3557947 1
< 0.1%
128.3557713 1
< 0.1%
128.3557497 1
< 0.1%
128.3556851 1
< 0.1%
128.3555074 1
< 0.1%
128.355373 1
< 0.1%
128.3553709 1
< 0.1%
128.3553657 1
< 0.1%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-08-30
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-30
2nd row2023-08-30
3rd row2023-08-30
4th row2023-08-30
5th row2023-08-30

Common Values

ValueCountFrequency (%)
2023-08-30 10000
100.0%

Length

2023-12-11T09:52:50.409212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:52:50.485270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-30 10000
100.0%

Interactions

2023-12-11T09:52:48.454626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.048603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.253184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.759661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.117739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.319308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.829219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.187164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:48.385735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:52:50.530492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도
연번1.0000.8160.816
위도0.8161.0001.000
경도0.8161.0001.000
2023-12-11T09:52:50.601648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도
연번1.0000.516-0.536
위도0.5161.000-0.288
경도-0.536-0.2881.000

Missing values

2023-12-11T09:52:48.916349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:52:48.991690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번도로명주소위도경도데이터기준일
48674868경상남도 진주시 금곡면 월아산로 94 (두문리)35.090131128.187022023-08-30
1623416235경상남도 진주시 진주대로 980 (강남동)35.187421128.0871352023-08-30
1028410285경상남도 진주시 지수면 지수로 449번길 28-135.231662128.2659762023-08-30
88048805경상남도 진주시 이반성면 오봉산로554번길 15-135.157626128.3184312023-08-30
1042010421경상남도 진주시 지수면 청원길 183 (지철마을)35.227063128.2940732023-08-30
1055210553경상남도 진주시 지수면 지사로 650 (금곡리)35.223389128.243662023-08-30
58975898경상남도 진주시 진성면 동부로1602번길 98(축사)35.180268128.2507332023-08-30
1656816569경상남도 진주시 진주대로 894-1835.180576128.0911252023-08-30
38243825경상남도 진주시 금곡면 검암길 73 엄정마을 (검암리)35.116599128.1863462023-08-30
2088420885경상남도 진주시 대밭골로60번길 4-5 (충무공동)35.183023128.1501662023-08-30
연번도로명주소위도경도데이터기준일
81048105경상남도 진주시 이반성면 오봉산로1025번길 52-935.19268128.333022023-08-30
19511952경상남도 진주시 문산읍 문산로 586 (갈곡리)35.143631128.219772023-08-30
1919919200경상남도 진주시 대신로195번길 21-135.17749128.1152972023-08-30
32263227경상남도 진주시 정촌면 강주길66번길 6-4 (예하리)35.113196128.0987062023-08-30
80168017경상남도 진주시 이반성면 발산길48번길 1135.139677128.3451812023-08-30
1871718718경상남도 진주시 공단로 95-135.17553128.1103872023-08-30
47724773경상남도 진주시 금곡면 월아산로 121 (두문리)35.092763128.1869712023-08-30
95709571경상남도 진주시 사봉면 지사로 1035.187671128.262562023-08-30
35073508경상남도 진주시 정촌면 삼일로72번길 18-4 [드림빌] (예하리)35.125806128.099722023-08-30
40284029경상남도 진주시 금곡면 검암길 23 (검암리)35.114793128.1818652023-08-30