Overview

Dataset statistics

Number of variables4
Number of observations58
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory37.2 B

Variable types

Categorical1
Text1
Numeric2

Dataset

Description굴착공사로부터 지하에 매설된 집단에너지 열수송관의 파손사고를 예방하기 위해집단에너지 사업자와 굴착공사정보(집단에너지사업자에게)와 굴착지역 인근 열수송관 매설여부(굴착공사자에게)를 제공
Author한국에너지공단
URLhttps://www.data.go.kr/data/15086265/fileData.do

Alerts

구분 has constant value ""Constant
굴착공사 신고접수(건) is highly overall correlated with 사업자 접수처리(건)High correlation
사업자 접수처리(건) is highly overall correlated with 굴착공사 신고접수(건)High correlation
집단에너지사업자명 has unique valuesUnique
사업자 접수처리(건) has 12 (20.7%) zerosZeros

Reproduction

Analysis started2024-03-14 10:49:46.168506
Analysis finished2024-03-14 10:49:47.753266
Duration1.58 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size592.0 B
2023
58 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 58
100.0%

Length

2024-03-14T19:49:47.863460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T19:49:48.021985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 58
100.0%
Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size592.0 B
2024-03-14T19:49:48.825396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length7.4310345
Min length4

Characters and Unicode

Total characters431
Distinct characters120
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row서울에너지공사
2nd row부산광역시
3rd row한국지역난방공사
4th row한국토지주택공사
5th rowGS파워(주)
ValueCountFrequency (%)
서울에너지공사 1
 
1.6%
무림파워텍㈜ 1
 
1.6%
주)석문에너지 1
 
1.6%
oci 1
 
1.6%
se㈜ 1
 
1.6%
㈜지에스이앤알 1
 
1.6%
코어엔텍 1
 
1.6%
sk에너지㈜ 1
 
1.6%
sgc에너지(주 1
 
1.6%
주)에팩 1
 
1.6%
Other values (51) 51
83.6%
2024-03-14T19:49:49.905670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
7.9%
32
 
7.4%
29
 
6.7%
28
 
6.5%
( 27
 
6.3%
) 27
 
6.3%
17
 
3.9%
12
 
2.8%
S 7
 
1.6%
7
 
1.6%
Other values (110) 211
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 333
77.3%
Open Punctuation 27
 
6.3%
Close Punctuation 27
 
6.3%
Uppercase Letter 22
 
5.1%
Other Symbol 17
 
3.9%
Space Separator 3
 
0.7%
Dash Punctuation 1
 
0.2%
Lowercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
10.2%
32
 
9.6%
29
 
8.7%
28
 
8.4%
12
 
3.6%
7
 
2.1%
7
 
2.1%
6
 
1.8%
6
 
1.8%
6
 
1.8%
Other values (95) 166
49.8%
Uppercase Letter
ValueCountFrequency (%)
S 7
31.8%
O 2
 
9.1%
C 2
 
9.1%
M 2
 
9.1%
I 2
 
9.1%
K 2
 
9.1%
L 2
 
9.1%
G 2
 
9.1%
E 1
 
4.5%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Other Symbol
ValueCountFrequency (%)
17
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 350
81.2%
Common 58
 
13.5%
Latin 23
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
9.7%
32
 
9.1%
29
 
8.3%
28
 
8.0%
17
 
4.9%
12
 
3.4%
7
 
2.0%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (96) 172
49.1%
Latin
ValueCountFrequency (%)
S 7
30.4%
O 2
 
8.7%
C 2
 
8.7%
M 2
 
8.7%
I 2
 
8.7%
K 2
 
8.7%
L 2
 
8.7%
G 2
 
8.7%
E 1
 
4.3%
n 1
 
4.3%
Common
ValueCountFrequency (%)
( 27
46.6%
) 27
46.6%
3
 
5.2%
- 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 333
77.3%
ASCII 81
 
18.8%
None 17
 
3.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
10.2%
32
 
9.6%
29
 
8.7%
28
 
8.4%
12
 
3.6%
7
 
2.1%
7
 
2.1%
6
 
1.8%
6
 
1.8%
6
 
1.8%
Other values (95) 166
49.8%
ASCII
ValueCountFrequency (%)
( 27
33.3%
) 27
33.3%
S 7
 
8.6%
3
 
3.7%
O 2
 
2.5%
C 2
 
2.5%
M 2
 
2.5%
I 2
 
2.5%
K 2
 
2.5%
L 2
 
2.5%
Other values (4) 5
 
6.2%
None
ValueCountFrequency (%)
17
100.0%

굴착공사 신고접수(건)
Real number (ℝ)

HIGH CORRELATION 

Distinct56
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1402.1034
Minimum2
Maximum28086
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size650.0 B
2024-03-14T19:49:50.136818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile25.15
Q198.25
median321.5
Q3877
95-th percentile4833.55
Maximum28086
Range28084
Interquartile range (IQR)778.75

Descriptive statistics

Standard deviation3986.3338
Coefficient of variation (CV)2.8431096
Kurtosis36.579031
Mean1402.1034
Median Absolute Deviation (MAD)283.5
Skewness5.7058503
Sum81322
Variance15890857
MonotonicityNot monotonic
2024-03-14T19:49:50.400987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58 2
 
3.4%
102 2
 
3.4%
6684 1
 
1.7%
28 1
 
1.7%
97 1
 
1.7%
80 1
 
1.7%
850 1
 
1.7%
616 1
 
1.7%
141 1
 
1.7%
45 1
 
1.7%
Other values (46) 46
79.3%
ValueCountFrequency (%)
2 1
1.7%
6 1
1.7%
9 1
1.7%
28 1
1.7%
33 1
1.7%
37 1
1.7%
45 1
1.7%
50 1
1.7%
56 1
1.7%
58 2
3.4%
ValueCountFrequency (%)
28086 1
1.7%
10496 1
1.7%
6684 1
1.7%
4507 1
1.7%
4260 1
1.7%
3652 1
1.7%
2286 1
1.7%
1889 1
1.7%
1396 1
1.7%
1370 1
1.7%

사업자 접수처리(건)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct45
Distinct (%)77.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1260.8621
Minimum0
Maximum25832
Zeros12
Zeros (%)20.7%
Negative0
Negative (%)0.0%
Memory size650.0 B
2024-03-14T19:49:50.806921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median197.5
Q3829.75
95-th percentile4744.75
Maximum25832
Range25832
Interquartile range (IQR)827.75

Descriptive statistics

Standard deviation3719.2459
Coefficient of variation (CV)2.9497643
Kurtosis34.746702
Mean1260.8621
Median Absolute Deviation (MAD)197.5
Skewness5.5499135
Sum73130
Variance13832790
MonotonicityNot monotonic
2024-03-14T19:49:51.142870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
0 12
 
20.7%
1 2
 
3.4%
2 2
 
3.4%
6449 1
 
1.7%
141 1
 
1.7%
790 1
 
1.7%
843 1
 
1.7%
36 1
 
1.7%
881 1
 
1.7%
408 1
 
1.7%
Other values (35) 35
60.3%
ValueCountFrequency (%)
0 12
20.7%
1 2
 
3.4%
2 2
 
3.4%
3 1
 
1.7%
22 1
 
1.7%
36 1
 
1.7%
67 1
 
1.7%
72 1
 
1.7%
91 1
 
1.7%
92 1
 
1.7%
ValueCountFrequency (%)
25832 1
1.7%
10300 1
1.7%
6449 1
1.7%
4444 1
1.7%
4244 1
1.7%
2692 1
1.7%
2281 1
1.7%
1879 1
1.7%
1390 1
1.7%
1364 1
1.7%

Interactions

2024-03-14T19:49:46.820544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:49:46.340067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:49:47.061034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T19:49:46.579634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T19:49:51.293370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집단에너지사업자명굴착공사 신고접수(건)사업자 접수처리(건)
집단에너지사업자명1.0001.0001.000
굴착공사 신고접수(건)1.0001.0001.000
사업자 접수처리(건)1.0001.0001.000
2024-03-14T19:49:51.444430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
굴착공사 신고접수(건)사업자 접수처리(건)
굴착공사 신고접수(건)1.0000.918
사업자 접수처리(건)0.9181.000

Missing values

2024-03-14T19:49:47.387140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T19:49:47.671544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분집단에너지사업자명굴착공사 신고접수(건)사업자 접수처리(건)
02023서울에너지공사66846449
12023부산광역시326325
22023한국지역난방공사2808625832
32023한국토지주택공사6430
42023GS파워(주)1049610300
52023안산도시개발18891879
62023인천공항에너지(주)5161
72023위드인천에너지(주)36522692
82023인천종합에너지(주)42604244
92023나래에너지서비스(주)45074444
구분집단에너지사업자명굴착공사 신고접수(건)사업자 접수처리(건)
482023에코비트에너지580
492023(주)코엔텍90
502023영남에너지서비스198190
512023(주)한주607604
522023한화에너지㈜18492
532023SK멀티유틸리티640
542023성림에너지㈜317242
552023울산광역시330
562023포승그린파워㈜563
572023(주)지에스포천그린에너지22