Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 462 |
Missing cells | 13 |
Missing cells (%) | 0.5% |
Duplicate rows | 20 |
Duplicate rows (%) | 4.3% |
Total size in memory | 22.2 KiB |
Average record size in memory | 49.3 B |
Variable types
Text | 2 |
---|---|
DateTime | 1 |
Categorical | 2 |
Numeric | 1 |
Dataset
Description | 전라남도 무안군 공간정보시스템에 등록된 전산화된 상수도 맨홀 정보(도엽번호, 설치일자, 규격, 맨홀종류, 맨홀형태, 법정동 등)를 제공 합니다. |
---|---|
URL | https://www.data.go.kr/data/15041000/fileData.do |
Dataset has 20 (4.3%) duplicate rows | Duplicates |
맨홀형태 is highly imbalanced (68.4%) | Imbalance |
규격 has 13 (2.8%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 21:58:39.414536 |
---|---|
Analysis finished | 2023-12-12 21:58:39.850280 |
Duration | 0.44 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
도엽번호
Text
Distinct | 280 |
---|---|
Distinct (%) | 60.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.7 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 4620 |
---|---|
Distinct characters | 14 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 200 ? |
---|---|
Unique (%) | 43.3% |
Sample
1st row | 346020503A |
---|---|
2nd row | 346020503D |
3rd row | 346020503C |
4th row | 356142594D |
5th row | 356142595C |
Value | Count | Frequency (%) |
346022097a | 13 | 2.8% |
346022087d | 11 | 2.4% |
346020512c | 10 | 2.2% |
346022097b | 9 | 1.9% |
346022096d | 9 | 1.9% |
346022096b | 8 | 1.7% |
346020526b | 7 | 1.5% |
346022097d | 6 | 1.3% |
346022097c | 6 | 1.3% |
346022088c | 6 | 1.3% |
Other values (270) | 377 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 741 | |
2 | 673 | |
3 | 620 | |
6 | 585 | |
4 | 560 | |
1 | 288 | 6.2% |
5 | 248 | 5.4% |
7 | 174 | 3.8% |
8 | 143 | 3.1% |
9 | 126 | 2.7% |
Other values (4) | 462 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 4158 | |
Uppercase Letter | 462 | 10.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 741 | |
2 | 673 | |
3 | 620 | |
6 | 585 | |
4 | 560 | |
1 | 288 | 6.9% |
5 | 248 | 6.0% |
7 | 174 | 4.2% |
8 | 143 | 3.4% |
9 | 126 | 3.0% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 118 | |
D | 116 | |
C | 115 | |
B | 113 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4158 | |
Latin | 462 | 10.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 741 | |
2 | 673 | |
3 | 620 | |
6 | 585 | |
4 | 560 | |
1 | 288 | 6.9% |
5 | 248 | 6.0% |
7 | 174 | 4.2% |
8 | 143 | 3.4% |
9 | 126 | 3.0% |
Latin
Value | Count | Frequency (%) |
A | 118 | |
D | 116 | |
C | 115 | |
B | 113 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4620 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 741 | |
2 | 673 | |
3 | 620 | |
6 | 585 | |
4 | 560 | |
1 | 288 | 6.2% |
5 | 248 | 5.4% |
7 | 174 | 3.8% |
8 | 143 | 3.1% |
9 | 126 | 2.7% |
Other values (4) | 462 |
설치일자
Date
Distinct | 19 |
---|---|
Distinct (%) | 4.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.7 KiB |
Minimum | 1900-01-01 00:00:00 |
---|---|
Maximum | 2020-01-01 00:00:00 |
규격
Text
MISSING
 
Distinct | 264 |
---|---|
Distinct (%) | 58.8% |
Missing | 13 |
Missing (%) | 2.8% |
Memory size | 3.7 KiB |
Length
Max length | 14 |
---|---|
Median length | 11 |
Mean length | 11.151448 |
Min length | 8 |
Characters and Unicode
Total characters | 5007 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 199 ? |
---|---|
Unique (%) | 44.3% |
Sample
1st row | 1.3x1.3x404 |
---|---|
2nd row | 1.5x2.7x1.4 |
3rd row | 2.0x2x2.72 |
4th row | 1.2x1.2x1.6 |
5th row | 1.2x1.2x4.0 |
Value | Count | Frequency (%) |
0.9x0.9x1.0 | 22 | 4.9% |
1.2x1.2x1.2 | 16 | 3.6% |
2.5x2.0x1.5 | 13 | 2.9% |
3.5x2.5x1.5 | 13 | 2.9% |
1.2x1.2x1.5 | 12 | 2.7% |
ø1200x1.2 | 10 | 2.2% |
1.2x1.2x1.6 | 7 | 1.6% |
1.3x1.5x0.63 | 7 | 1.6% |
1.2x2.0x1.2 | 6 | 1.3% |
2.0x2.5x1.5 | 6 | 1.3% |
Other values (252) | 337 |
Most occurring characters
Value | Count | Frequency (%) |
. | 1289 | |
1 | 890 | |
x | 866 | |
2 | 523 | |
0 | 416 | 8.3% |
5 | 404 | 8.1% |
3 | 226 | 4.5% |
9 | 89 | 1.8% |
4 | 80 | 1.6% |
6 | 73 | 1.5% |
Other values (4) | 151 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2820 | |
Other Punctuation | 1289 | |
Lowercase Letter | 866 | 17.3% |
Uppercase Letter | 32 | 0.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 890 | |
2 | 523 | |
0 | 416 | |
5 | 404 | |
3 | 226 | 8.0% |
9 | 89 | 3.2% |
4 | 80 | 2.8% |
6 | 73 | 2.6% |
8 | 63 | 2.2% |
7 | 56 | 2.0% |
Uppercase Letter
Value | Count | Frequency (%) |
Ø | 26 | |
X | 6 | 18.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1289 |
Lowercase Letter
Value | Count | Frequency (%) |
x | 866 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4109 | |
Latin | 898 | 17.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 1289 | |
1 | 890 | |
2 | 523 | |
0 | 416 | 10.1% |
5 | 404 | 9.8% |
3 | 226 | 5.5% |
9 | 89 | 2.2% |
4 | 80 | 1.9% |
6 | 73 | 1.8% |
8 | 63 | 1.5% |
Latin
Value | Count | Frequency (%) |
x | 866 | |
Ø | 26 | 2.9% |
X | 6 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4981 | |
None | 26 | 0.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 1289 | |
1 | 890 | |
x | 866 | |
2 | 523 | |
0 | 416 | 8.4% |
5 | 404 | 8.1% |
3 | 226 | 4.5% |
9 | 89 | 1.8% |
4 | 80 | 1.6% |
6 | 73 | 1.5% |
Other values (3) | 125 | 2.5% |
None
Value | Count | Frequency (%) |
Ø | 26 |
맨홀종류
Categorical
Distinct | 15 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.7 KiB |
SOM999 | |
---|---|
SOM040 | |
SOM012 | |
SOM903 | |
SOM914 | |
Other values (10) |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | SOM040 |
---|---|
2nd row | SOM040 |
3rd row | SOM040 |
4th row | SOM040 |
5th row | SOM040 |
Common Values
Value | Count | Frequency (%) |
SOM999 | 153 | |
SOM040 | 107 | |
SOM012 | 39 | 8.4% |
SOM903 | 37 | 8.0% |
SOM914 | 31 | 6.7% |
SOM915 | 25 | 5.4% |
SOM002 | 25 | 5.4% |
SOM000 | 13 | 2.8% |
SOM013 | 9 | 1.9% |
SOM015 | 7 | 1.5% |
Other values (5) | 16 | 3.5% |
Length
Value | Count | Frequency (%) |
som999 | 153 | |
som040 | 107 | |
som012 | 39 | 8.4% |
som903 | 37 | 8.0% |
som914 | 31 | 6.7% |
som915 | 25 | 5.4% |
som002 | 25 | 5.4% |
som000 | 13 | 2.8% |
som013 | 9 | 1.9% |
som015 | 7 | 1.5% |
Other values (5) | 16 | 3.5% |
맨홀형태
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.7 KiB |
MHS003 | |
---|---|
MHS001 | 26 |
MHS000 | 13 |
MHS005 | 9 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MHS005 |
---|---|
2nd row | MHS005 |
3rd row | MHS005 |
4th row | MHS005 |
5th row | MHS005 |
Common Values
Value | Count | Frequency (%) |
MHS003 | 414 | |
MHS001 | 26 | 5.6% |
MHS000 | 13 | 2.8% |
MHS005 | 9 | 1.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
mhs003 | 414 | |
mhs001 | 26 | 5.6% |
mhs000 | 13 | 2.8% |
mhs005 | 9 | 1.9% |
법정동
Real number (ℝ)
Distinct | 10 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6840279 × 109 |
Minimum | 4.684025 × 109 |
---|---|
Maximum | 4.684037 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.2 KiB |
Quantile statistics
Minimum | 4.684025 × 109 |
---|---|
5-th percentile | 4.684025 × 109 |
Q1 | 4.6840253 × 109 |
median | 4.6840253 × 109 |
Q3 | 4.684033 × 109 |
95-th percentile | 4.684035 × 109 |
Maximum | 4.684037 × 109 |
Range | 12000 |
Interquartile range (IQR) | 7700 |
Descriptive statistics
Standard deviation | 4113.1302 |
---|---|
Coefficient of variation (CV) | 8.7811821 × 10-7 |
Kurtosis | -0.66404703 |
Mean | 4.6840279 × 109 |
Median Absolute Deviation (MAD) | 300 |
Skewness | 1.0657536 |
Sum | 2.1640209 × 1012 |
Variance | 16917840 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4684025300 | 172 | |
4684025622 | 78 | |
4684025000 | 74 | |
4684034000 | 72 | |
4684037000 | 21 | 4.5% |
4684033000 | 15 | 3.2% |
4684032000 | 14 | 3.0% |
4684035000 | 8 | 1.7% |
4684025600 | 6 | 1.3% |
4684036000 | 2 | 0.4% |
Value | Count | Frequency (%) |
4684025000 | 74 | |
4684025300 | 172 | |
4684025600 | 6 | 1.3% |
4684025622 | 78 | |
4684032000 | 14 | 3.0% |
4684033000 | 15 | 3.2% |
4684034000 | 72 | |
4684035000 | 8 | 1.7% |
4684036000 | 2 | 0.4% |
4684037000 | 21 | 4.5% |
Value | Count | Frequency (%) |
4684037000 | 21 | 4.5% |
4684036000 | 2 | 0.4% |
4684035000 | 8 | 1.7% |
4684034000 | 72 | |
4684033000 | 15 | 3.2% |
4684032000 | 14 | 3.0% |
4684025622 | 78 | |
4684025600 | 6 | 1.3% |
4684025300 | 172 | |
4684025000 | 74 |
설치일자 | 맨홀종류 | 맨홀형태 | 법정동 | |
---|---|---|---|---|
설치일자 | 1.000 | 0.758 | 0.619 | 0.777 |
맨홀종류 | 0.758 | 1.000 | 0.331 | 0.440 |
맨홀형태 | 0.619 | 0.331 | 1.000 | 0.148 |
법정동 | 0.777 | 0.440 | 0.148 | 1.000 |
맨홀종류 | 맨홀형태 | |
---|---|---|
맨홀종류 | 1.000 | 0.191 |
맨홀형태 | 0.191 | 1.000 |
법정동 | 맨홀종류 | 맨홀형태 | |
---|---|---|---|
법정동 | 1.000 | 0.209 | 0.070 |
맨홀종류 | 0.209 | 1.000 | 0.191 |
맨홀형태 | 0.070 | 0.191 | 1.000 |
도엽번호 | 설치일자 | 규격 | 맨홀종류 | 맨홀형태 | 법정동 | |
---|---|---|---|---|---|---|
0 | 346020503A | 1900-01-01 | 1.3x1.3x404 | SOM040 | MHS005 | 4684025300 |
1 | 346020503D | 2007-01-01 | 1.5x2.7x1.4 | SOM040 | MHS005 | 4684025300 |
2 | 346020503C | 2007-01-01 | 2.0x2x2.72 | SOM040 | MHS005 | 4684025300 |
3 | 356142594D | 2010-01-01 | 1.2x1.2x1.6 | SOM040 | MHS005 | 4684025300 |
4 | 356142595C | 2010-01-01 | 1.2x1.2x4.0 | SOM040 | MHS005 | 4684025300 |
5 | 356142596D | 2010-01-01 | 1.2x1.5x3.0 | SOM040 | MHS005 | 4684025300 |
6 | 356142596D | 2010-01-01 | 1.2x1.2x3.0 | SOM040 | MHS005 | 4684025300 |
7 | 346020516A | 2007-01-01 | 1.4x0.9x4.2 | SOM040 | MHS005 | 4684025300 |
8 | 346020516A | 2007-01-01 | 0.9x0.9x4.2 | SOM040 | MHS005 | 4684025300 |
9 | 346020517A | 2007-01-01 | 2.5x2.8x1.9 | SOM040 | MHS003 | 4684025000 |
도엽번호 | 설치일자 | 규격 | 맨홀종류 | 맨홀형태 | 법정동 | |
---|---|---|---|---|---|---|
452 | 346022033B | 2007-01-01 | 1.7x2.4x1.7 | SOM999 | MHS000 | 4684025000 |
453 | 346021570B | 2007-01-01 | 1.5x1.5x1.9 | SOM999 | MHS000 | 4684034000 |
454 | 346021500C | 2007-01-01 | 1.5x1.5x1.3 | SOM999 | MHS000 | 4684034000 |
455 | 346031141A | 2007-01-01 | 1.3x1.3x2.4 | SOM999 | MHS000 | 4684034000 |
456 | 346031171C | 2007-01-01 | 1.3x1.3x2.4 | SOM999 | MHS000 | 4684034000 |
457 | 346021500D | 2007-01-01 | 1.3x1.3x2.4 | SOM999 | MHS000 | 4684034000 |
458 | 346021587D | 2007-01-01 | 1.2x1.2x2.0 | SOM002 | MHS000 | 4684025622 |
459 | 346021587A | 2007-01-01 | 1.2x1.2x1.0 | SOM002 | MHS000 | 4684025300 |
460 | 346022016C | 2007-01-01 | 1.0x1.5x0.9 | SOM999 | MHS000 | 4684025300 |
461 | 346022025B | 2007-01-01 | 1.0x1.5x0.9 | SOM999 | MHS000 | 4684033000 |
Most frequently occurring
도엽번호 | 설치일자 | 규격 | 맨홀종류 | 맨홀형태 | 법정동 | # duplicates | |
---|---|---|---|---|---|---|---|
12 | 346022088C | 2019-01-01 | 3.5x2.5x1.5 | SOM999 | MHS003 | 4684025300 | 4 |
2 | 346020526B | 1900-01-01 | 1.2x1.2x1.2 | SOM040 | MHS003 | 4684025300 | 3 |
14 | 346022096D | 2019-01-01 | 2.5x2.0x1.5 | SOM999 | MHS003 | 4684025300 | 3 |
0 | 346020503A | 2011-01-01 | 1.3x1.3x3.69 | SOM040 | MHS003 | 4684025000 | 2 |
1 | 346020526B | 1900-01-01 | 1.2x1.2x1.2 | SOM040 | MHS003 | 4684025000 | 2 |
3 | 346020527D | 1900-01-01 | 2.0x1.2x2.0 | SOM040 | MHS003 | 4684025000 | 2 |
4 | 346021440B | 2007-01-01 | 1.2x1.2x4.7 | SOM002 | MHS001 | 4684025000 | 2 |
5 | 346022072B | 2010-01-01 | 1.2x1.2x1.3 | SOM012 | MHS003 | 4684025622 | 2 |
6 | 346022082C | 2010-01-01 | 1.2x1.2x1.2 | SOM012 | MHS003 | 4684025622 | 2 |
7 | 346022087B | 2019-01-01 | 1.2x1.2x1.5 | SOM903 | MHS003 | 4684025300 | 2 |