Overview

Dataset statistics

Number of variables10
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory85.6 B

Variable types

Categorical7
Numeric2
Text1

Dataset

Description한국지역난방공사의 지하매설물 현황에 대한 데이터로 구분, 관경, 관경단위, 타입, 기준, 배관길이, 배관단위, 비율, 비율단위에 대한 정보를 제공합니다.
Author한국지역난방공사
URLhttps://www.data.go.kr/data/15002877/fileData.do

Alerts

기준일 has constant value ""Constant
관경단위 has constant value ""Constant
기준 has constant value ""Constant
배관단위 has constant value ""Constant
비율단위 has constant value ""Constant
관경 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 관경High correlation
배관길이 has 3 (8.3%) zerosZeros

Reproduction

Analysis started2024-03-14 09:06:04.268364
Analysis finished2024-03-14 09:06:06.468001
Duration2.2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size416.0 B
2023-12-31
36 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-31
2nd row2023-12-31
3rd row2023-12-31
4th row2023-12-31
5th row2023-12-31

Common Values

ValueCountFrequency (%)
2023-12-31 36
100.0%

Length

2024-03-14T18:06:06.676952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:06:06.990078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-31 36
100.0%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size416.0 B
분배관
21 
주수송관
15 

Length

Max length4
Median length3
Mean length3.4166667
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주수송관
2nd row주수송관
3rd row주수송관
4th row주수송관
5th row주수송관

Common Values

ValueCountFrequency (%)
분배관 21
58.3%
주수송관 15
41.7%

Length

2024-03-14T18:06:07.321147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:06:07.640970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분배관 21
58.3%
주수송관 15
41.7%

관경
Real number (ℝ)

HIGH CORRELATION 

Distinct27
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean325.11111
Minimum20
Maximum1100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size452.0 B
2024-03-14T18:06:07.975343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile25
Q161.25
median150
Q3562.5
95-th percentile925
Maximum1100
Range1080
Interquartile range (IQR)501.25

Descriptive statistics

Standard deviation328.27678
Coefficient of variation (CV)1.0097372
Kurtosis-0.44773427
Mean325.11111
Median Absolute Deviation (MAD)121.5
Skewness0.92397155
Sum11704
Variance107765.64
MonotonicityNot monotonic
2024-03-14T18:06:08.359334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
25 2
 
5.6%
32 2
 
5.6%
40 2
 
5.6%
50 2
 
5.6%
65 2
 
5.6%
80 2
 
5.6%
100 2
 
5.6%
125 2
 
5.6%
150 2
 
5.6%
1100 1
 
2.8%
Other values (17) 17
47.2%
ValueCountFrequency (%)
20 1
2.8%
25 2
5.6%
32 2
5.6%
40 2
5.6%
50 2
5.6%
65 2
5.6%
80 2
5.6%
100 2
5.6%
125 2
5.6%
150 2
5.6%
ValueCountFrequency (%)
1100 1
2.8%
1000 1
2.8%
900 1
2.8%
850 1
2.8%
800 1
2.8%
750 1
2.8%
700 1
2.8%
650 1
2.8%
600 1
2.8%
550 1
2.8%

관경단위
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size416.0 B
A
36 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA
2nd rowA
3rd rowA
4th rowA
5th rowA

Common Values

ValueCountFrequency (%)
A 36
100.0%

Length

2024-03-14T18:06:08.762879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:06:09.074835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a 36
100.0%

타입
Categorical

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size416.0 B
single
27 
Twin

Length

Max length6
Median length6
Mean length5.5
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowsingle
2nd rowsingle
3rd rowsingle
4th rowsingle
5th rowsingle

Common Values

ValueCountFrequency (%)
single 27
75.0%
Twin 9
 
25.0%

Length

2024-03-14T18:06:09.431196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:06:09.783988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
single 27
75.0%
twin 9
 
25.0%

기준
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size416.0 B
DN
36 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDN
2nd rowDN
3rd rowDN
4th rowDN
5th rowDN

Common Values

ValueCountFrequency (%)
DN 36
100.0%

Length

2024-03-14T18:06:10.136219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:06:10.452839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
dn 36
100.0%

배관길이
Real number (ℝ)

ZEROS 

Distinct34
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean135693.6
Minimum0
Maximum618335.38
Zeros3
Zeros (%)8.3%
Negative0
Negative (%)0.0%
Memory size452.0 B
2024-03-14T18:06:10.762327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13754.2713
median19183.725
Q3260754.51
95-th percentile508102.98
Maximum618335.38
Range618335.38
Interquartile range (IQR)257000.24

Descriptive statistics

Standard deviation181004.47
Coefficient of variation (CV)1.3339205
Kurtosis0.69083361
Mean135693.6
Median Absolute Deviation (MAD)19183.725
Skewness1.29879
Sum4884969.5
Variance3.2762617 × 1010
MonotonicityNot monotonic
2024-03-14T18:06:11.168169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
0.0 3
 
8.3%
4111.91 1
 
2.8%
4993.4 1
 
2.8%
111380.17 1
 
2.8%
95408.05 1
 
2.8%
34840.96 1
 
2.8%
11468.84 1
 
2.8%
19503.38 1
 
2.8%
12892.23 1
 
2.8%
3786.235 1
 
2.8%
Other values (24) 24
66.7%
ValueCountFrequency (%)
0.0 3
8.3%
227.52 1
 
2.8%
269.08 1
 
2.8%
602.195 1
 
2.8%
668.145 1
 
2.8%
2713.99 1
 
2.8%
3658.38 1
 
2.8%
3786.235 1
 
2.8%
4111.91 1
 
2.8%
4993.4 1
 
2.8%
ValueCountFrequency (%)
618335.38 1
2.8%
577118.43 1
2.8%
485097.835 1
2.8%
417214.83 1
2.8%
357633.52 1
2.8%
351706.25 1
2.8%
311025.16 1
2.8%
278269.21 1
2.8%
272967.53 1
2.8%
256683.5 1
2.8%

배관단위
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size416.0 B
m
36 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowm
2nd rowm
3rd rowm
4th rowm
5th rowm

Common Values

ValueCountFrequency (%)
m 36
100.0%

Length

2024-03-14T18:06:11.577378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:06:11.889094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 36
100.0%

비율
Text

Distinct30
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Memory size416.0 B
2024-03-14T18:06:12.554097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length5.0555556
Min length5

Characters and Unicode

Total characters182
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)75.0%

Sample

1st row0.08%
2nd row0.18%
3rd row0.39%
4th row2.71%
5th row2.88%
ValueCountFrequency (%)
0.00 4
 
11.1%
0.01 3
 
8.3%
0.08 2
 
5.6%
11.81 1
 
2.8%
2.71 1
 
2.8%
9.93 1
 
2.8%
0.39 1
 
2.8%
0.06 1
 
2.8%
0.10 1
 
2.8%
0.26 1
 
2.8%
Other values (20) 20
55.6%
2024-03-14T18:06:13.681698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 39
21.4%
. 36
19.8%
% 36
19.8%
1 14
 
7.7%
2 12
 
6.6%
8 10
 
5.5%
5 8
 
4.4%
7 7
 
3.8%
6 6
 
3.3%
9 6
 
3.3%
Other values (2) 8
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 110
60.4%
Other Punctuation 72
39.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 39
35.5%
1 14
 
12.7%
2 12
 
10.9%
8 10
 
9.1%
5 8
 
7.3%
7 7
 
6.4%
6 6
 
5.5%
9 6
 
5.5%
3 5
 
4.5%
4 3
 
2.7%
Other Punctuation
ValueCountFrequency (%)
. 36
50.0%
% 36
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 182
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 39
21.4%
. 36
19.8%
% 36
19.8%
1 14
 
7.7%
2 12
 
6.6%
8 10
 
5.5%
5 8
 
4.4%
7 7
 
3.8%
6 6
 
3.3%
9 6
 
3.3%
Other values (2) 8
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 182
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 39
21.4%
. 36
19.8%
% 36
19.8%
1 14
 
7.7%
2 12
 
6.6%
8 10
 
5.5%
5 8
 
4.4%
7 7
 
3.8%
6 6
 
3.3%
9 6
 
3.3%
Other values (2) 8
 
4.4%

비율단위
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size416.0 B
%
36 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row%
2nd row%
3rd row%
4th row%
5th row%

Common Values

ValueCountFrequency (%)
% 36
100.0%

Length

2024-03-14T18:06:14.099078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:06:14.410600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
36
100.0%

Interactions

2024-03-14T18:06:05.020761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:06:04.538328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:06:05.262285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:06:04.772163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T18:06:14.540449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분관경타입배관길이비율
구분1.0000.9980.5800.5320.951
관경0.9981.0000.2320.6480.994
타입0.5800.2321.0000.2390.913
배관길이0.5320.6480.2391.0001.000
비율0.9510.9940.9131.0001.000
2024-03-14T18:06:14.700248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분타입
구분1.0000.393
타입0.3931.000
2024-03-14T18:06:14.840935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관경배관길이구분타입
관경1.0000.3830.8400.131
배관길이0.3831.0000.4710.198
구분0.8400.4711.0000.393
타입0.1310.1980.3931.000

Missing values

2024-03-14T18:06:05.820580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T18:06:06.292979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준일구분관경관경단위타입기준배관길이배관단위비율비율단위
02023-12-31주수송관1100AsingleDN4111.91m0.08%%
12023-12-31주수송관1000AsingleDN8665.35m0.18%%
22023-12-31주수송관900AsingleDN18864.07m0.39%%
32023-12-31주수송관850AsingleDN132441.32m2.71%%
42023-12-31주수송관800AsingleDN140477.82m2.88%%
52023-12-31주수송관750AsingleDN13619.67m0.28%%
62023-12-31주수송관700AsingleDN357633.52m7.32%%
72023-12-31주수송관650AsingleDN3658.38m0.07%%
82023-12-31주수송관600AsingleDN311025.16m6.37%%
92023-12-31주수송관550AsingleDN9256.64m0.19%%
기준일구분관경관경단위타입기준배관길이배관단위비율비율단위
262023-12-31분배관20AsingleDN4993.4m0.10%%
272023-12-31분배관150ATwinDN3786.235m0.08%%
282023-12-31분배관125ATwinDN2713.99m0.06%%
292023-12-31분배관100ATwinDN602.195m0.01%%
302023-12-31분배관80ATwinDN668.145m0.01%%
312023-12-31분배관65ATwinDN0.0m0.00%%
322023-12-31분배관50ATwinDN227.52m0.00%%
332023-12-31분배관40ATwinDN269.08m0.01%%
342023-12-31분배관32ATwinDN0.0m0.00%%
352023-12-31분배관25ATwinDN0.0m0.00%%