Overview

Dataset statistics

Number of variables14
Number of observations192
Missing cells1834
Missing cells (%)68.2%
Duplicate rows5
Duplicate rows (%)2.6%
Total size in memory21.1 KiB
Average record size in memory112.7 B

Variable types

Text2
Categorical1
Unsupported11

Dataset

Description대전광역시의 공원관리사업소 내 조명 설치현황입니다. (전체 현황, 각 공원별 현황 구분), 소비전역 등이 있습니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15077439/fileData.do

Alerts

Dataset has 5 (2.6%) duplicate rowsDuplicates
1. 공원관리사업소 가로등, 보안등, 공원등 설치현황 has 154 (80.2%) missing valuesMissing
Unnamed: 1 has 190 (99.0%) missing valuesMissing
Unnamed: 3 has 5 (2.6%) missing valuesMissing
Unnamed: 4 has 145 (75.5%) missing valuesMissing
Unnamed: 5 has 146 (76.0%) missing valuesMissing
Unnamed: 6 has 147 (76.6%) missing valuesMissing
Unnamed: 7 has 142 (74.0%) missing valuesMissing
Unnamed: 8 has 127 (66.1%) missing valuesMissing
Unnamed: 9 has 145 (75.5%) missing valuesMissing
Unnamed: 10 has 150 (78.1%) missing valuesMissing
Unnamed: 11 has 145 (75.5%) missing valuesMissing
Unnamed: 12 has 148 (77.1%) missing valuesMissing
Unnamed: 13 has 190 (99.0%) missing valuesMissing
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 20:49:25.992323
Analysis finished2023-12-12 20:49:26.967340
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct38
Distinct (%)100.0%
Missing154
Missing (%)80.2%
Memory size1.6 KiB
2023-12-13T05:49:27.154007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length18
Mean length12.473684
Min length2

Characters and Unicode

Total characters474
Distinct characters125
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)100.0%

Sample

1st row구분
2nd row공원관리사업소 전체
3rd row경익운수 ~ 느티나무구간
4th row보훈공원삼거리 ~ 보훈공원입구
5th row문화농장 ~ 까치약수터 ~ 까치탑
ValueCountFrequency (%)
24
23.8%
청년광장 5
 
5.0%
숲속공연장 3
 
3.0%
주차장 3
 
3.0%
인라인스케이트장위 2
 
2.0%
배드민턴장 2
 
2.0%
문화배수지 2
 
2.0%
송학사 2
 
2.0%
망향탑 2
 
2.0%
보석천약수터 2
 
2.0%
Other values (51) 54
53.5%
2023-12-13T05:49:27.737136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
81
 
17.1%
26
 
5.5%
~ 25
 
5.3%
16
 
3.4%
10
 
2.1%
10
 
2.1%
9
 
1.9%
9
 
1.9%
8
 
1.7%
8
 
1.7%
Other values (115) 272
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 354
74.7%
Space Separator 81
 
17.1%
Math Symbol 25
 
5.3%
Open Punctuation 4
 
0.8%
Close Punctuation 4
 
0.8%
Decimal Number 4
 
0.8%
Dash Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
7.3%
16
 
4.5%
10
 
2.8%
10
 
2.8%
9
 
2.5%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
Other values (105) 242
68.4%
Decimal Number
ValueCountFrequency (%)
3 1
25.0%
4 1
25.0%
1 1
25.0%
2 1
25.0%
Space Separator
ValueCountFrequency (%)
81
100.0%
Math Symbol
ValueCountFrequency (%)
~ 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 354
74.7%
Common 120
 
25.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
7.3%
16
 
4.5%
10
 
2.8%
10
 
2.8%
9
 
2.5%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
Other values (105) 242
68.4%
Common
ValueCountFrequency (%)
81
67.5%
~ 25
 
20.8%
( 4
 
3.3%
) 4
 
3.3%
3 1
 
0.8%
4 1
 
0.8%
- 1
 
0.8%
1 1
 
0.8%
, 1
 
0.8%
2 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 354
74.7%
ASCII 120
 
25.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
81
67.5%
~ 25
 
20.8%
( 4
 
3.3%
) 4
 
3.3%
3 1
 
0.8%
4 1
 
0.8%
- 1
 
0.8%
1 1
 
0.8%
, 1
 
0.8%
2 1
 
0.8%
Hangul
ValueCountFrequency (%)
26
 
7.3%
16
 
4.5%
10
 
2.8%
10
 
2.8%
9
 
2.5%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
Other values (105) 242
68.4%

Unnamed: 1
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing190
Missing (%)99.0%
Memory size1.6 KiB
2023-12-13T05:49:28.004246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length23
Mean length23
Min length4

Characters and Unicode

Total characters46
Distinct characters25
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row계약전력
2nd row 공원등 : LED등 1527/ 총 등수2177 * 100 = 70.14%
ValueCountFrequency (%)
3
27.3%
계약전력 1
 
9.1%
공원등 1
 
9.1%
led등 1
 
9.1%
1527 1
 
9.1%
1
 
9.1%
등수2177 1
 
9.1%
100 1
 
9.1%
70.14 1
 
9.1%
2023-12-13T05:49:28.581493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
23.9%
1 4
 
8.7%
7 4
 
8.7%
0 3
 
6.5%
3
 
6.5%
2 2
 
4.3%
4 1
 
2.2%
. 1
 
2.2%
= 1
 
2.2%
* 1
 
2.2%
Other values (15) 15
32.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15
32.6%
Space Separator 11
23.9%
Other Letter 11
23.9%
Other Punctuation 5
 
10.9%
Uppercase Letter 3
 
6.5%
Math Symbol 1
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
27.3%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 4
26.7%
7 4
26.7%
0 3
20.0%
2 2
13.3%
4 1
 
6.7%
5 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 1
20.0%
* 1
20.0%
/ 1
20.0%
: 1
20.0%
% 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
E 1
33.3%
L 1
33.3%
Space Separator
ValueCountFrequency (%)
11
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32
69.6%
Hangul 11
 
23.9%
Latin 3
 
6.5%

Most frequent character per script

Common
ValueCountFrequency (%)
11
34.4%
1 4
 
12.5%
7 4
 
12.5%
0 3
 
9.4%
2 2
 
6.2%
4 1
 
3.1%
. 1
 
3.1%
= 1
 
3.1%
* 1
 
3.1%
/ 1
 
3.1%
Other values (3) 3
 
9.4%
Hangul
ValueCountFrequency (%)
3
27.3%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
Latin
ValueCountFrequency (%)
D 1
33.3%
E 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 35
76.1%
Hangul 11
 
23.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11
31.4%
1 4
 
11.4%
7 4
 
11.4%
0 3
 
8.6%
2 2
 
5.7%
4 1
 
2.9%
. 1
 
2.9%
= 1
 
2.9%
* 1
 
2.9%
/ 1
 
2.9%
Other values (6) 6
17.1%
Hangul
ValueCountFrequency (%)
3
27.3%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%

Unnamed: 2
Categorical

Distinct7
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
합계
37 
나트륨
37 
메탈
37 
LED
37 
기타
37 
Other values (2)

Length

Max length4
Median length3
Mean length2.640625
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row<NA>
2nd row<NA>
3rd row램프
4th row<NA>
5th row합계

Common Values

ValueCountFrequency (%)
합계 37
19.3%
나트륨 37
19.3%
메탈 37
19.3%
LED 37
19.3%
기타 37
19.3%
<NA> 6
 
3.1%
램프 1
 
0.5%

Length

2023-12-13T05:49:28.905543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:49:29.176518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합계 37
19.3%
나트륨 37
19.3%
메탈 37
19.3%
led 37
19.3%
기타 37
19.3%
na 6
 
3.1%
램프 1
 
0.5%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)2.6%
Memory size1.6 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing145
Missing (%)75.5%
Memory size1.6 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing146
Missing (%)76.0%
Memory size1.6 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing147
Missing (%)76.6%
Memory size1.6 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing142
Missing (%)74.0%
Memory size1.6 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing127
Missing (%)66.1%
Memory size1.6 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing145
Missing (%)75.5%
Memory size1.6 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing150
Missing (%)78.1%
Memory size1.6 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing145
Missing (%)75.5%
Memory size1.6 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing148
Missing (%)77.1%
Memory size1.6 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing190
Missing (%)99.0%
Memory size1.6 KiB

Correlations

2023-12-13T05:49:29.330887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
1. 공원관리사업소 가로등, 보안등, 공원등 설치현황Unnamed: 1Unnamed: 2
1. 공원관리사업소 가로등, 보안등, 공원등 설치현황1.000NaN1.000
Unnamed: 1NaN1.000NaN
Unnamed: 21.000NaN1.000

Missing values

2023-12-13T05:49:26.262078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:49:26.511045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:49:26.747033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

1. 공원관리사업소 가로등, 보안등, 공원등 설치현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
0<NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1<NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaN(단위 : 개소)NaN
2구분계약전력램프소비전력NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
3<NA><NA><NA>합계15~50W51~60W61~70W71~80W91~100W125~150W151~175W201~250W1000W초과기타
4공원관리사업소 전체<NA>합계170159145673313311158889440.507349
5<NA><NA>나트륨26700013648044390NaN
6<NA><NA>메탈6800000002444NaN
7<NA><NA>LED863447456112315344260NaN
8<NA><NA>기타5031440611845262000NaN
9경익운수 ~ 느티나무구간<NA>합계140000014000NaN
1. 공원관리사업소 가로등, 보안등, 공원등 설치현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
182<NA><NA>LED6NaNNaN6NaNNaNNaNNaNNaNNaNNaN
183<NA><NA>기타0NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
184치유의숲<NA>합계313100000000NaN
185<NA><NA>나트륨0NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
186<NA><NA>메탈0NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
187<NA><NA>LED3131NaNNaNNaNNaNNaNNaNNaNNaNNaN
188<NA><NA>기타0NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
189<NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
190<NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
191<NA>공원등 : LED등 1527/ 총 등수2177 * 100 = 70.14%<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN

Duplicate rows

Most frequently occurring

1. 공원관리사업소 가로등, 보안등, 공원등 설치현황Unnamed: 1Unnamed: 2# duplicates
0<NA><NA>LED37
1<NA><NA>기타37
2<NA><NA>나트륨37
3<NA><NA>메탈37
4<NA><NA><NA>5