Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Categorical5
Text1

Dataset

Description전국 전주전산화번호(전주번호)
Author한국전력공사
URLhttps://www.data.go.kr/data/15069417/fileData.do

Alerts

1차순번 has constant value ""Constant
1차본부 has constant value ""Constant
2차사업소 is highly overall correlated with 2차순번High correlation
2차순번 is highly overall correlated with 2차사업소High correlation
전산화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:22:39.091871
Analysis finished2023-12-12 15:22:39.661711
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

1차순번
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
7
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row7
2nd row7
3rd row7
4th row7
5th row7

Common Values

ValueCountFrequency (%)
7 10000
100.0%

Length

2023-12-13T00:22:39.728437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:22:39.826349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7 10000
100.0%

1차본부
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충북본부
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충북본부
2nd row충북본부
3rd row충북본부
4th row충북본부
5th row충북본부

Common Values

ValueCountFrequency (%)
충북본부 10000
100.0%

Length

2023-12-13T00:22:39.938191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:22:40.046619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충북본부 10000
100.0%

2차순번
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5817 
2
4183 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row1
5th row2

Common Values

ValueCountFrequency (%)
1 5817
58.2%
2 4183
41.8%

Length

2023-12-13T00:22:40.145667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:22:40.549790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5817
58.2%
2 4183
41.8%

2차사업소
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충북본부직할
5817 
동청주지사
4183 

Length

Max length6
Median length6
Mean length5.5817
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충북본부직할
2nd row동청주지사
3rd row충북본부직할
4th row충북본부직할
5th row동청주지사

Common Values

ValueCountFrequency (%)
충북본부직할 5817
58.2%
동청주지사 4183
41.8%

Length

2023-12-13T00:22:40.663655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:22:40.785853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충북본부직할 5817
58.2%
동청주지사 4183
41.8%

전산화번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:22:41.149737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters80000
Distinct characters27
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row1872D031
2nd row2179P181
3rd row2074F271
4th row2172P281
5th row2178F871
ValueCountFrequency (%)
1872d031 1
 
< 0.1%
2075h004 1
 
< 0.1%
2276s525 1
 
< 0.1%
2170z151 1
 
< 0.1%
1678s593 1
 
< 0.1%
2274w931 1
 
< 0.1%
2365w711 1
 
< 0.1%
2064g881 1
 
< 0.1%
2480c601 1
 
< 0.1%
2074s811 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-13T00:22:41.754740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 14185
17.7%
2 12655
15.8%
7 12033
15.0%
3 5289
 
6.6%
6 5196
 
6.5%
4 4608
 
5.8%
5 4369
 
5.5%
8 4313
 
5.4%
0 4176
 
5.2%
9 3989
 
5.0%
Other values (17) 9187
11.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 70813
88.5%
Uppercase Letter 9187
 
11.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
X 609
 
6.6%
E 607
 
6.6%
Z 603
 
6.6%
H 602
 
6.6%
A 593
 
6.5%
R 591
 
6.4%
G 589
 
6.4%
B 589
 
6.4%
F 579
 
6.3%
C 562
 
6.1%
Other values (7) 3263
35.5%
Decimal Number
ValueCountFrequency (%)
1 14185
20.0%
2 12655
17.9%
7 12033
17.0%
3 5289
 
7.5%
6 5196
 
7.3%
4 4608
 
6.5%
5 4369
 
6.2%
8 4313
 
6.1%
0 4176
 
5.9%
9 3989
 
5.6%

Most occurring scripts

ValueCountFrequency (%)
Common 70813
88.5%
Latin 9187
 
11.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
X 609
 
6.6%
E 607
 
6.6%
Z 603
 
6.6%
H 602
 
6.6%
A 593
 
6.5%
R 591
 
6.4%
G 589
 
6.4%
B 589
 
6.4%
F 579
 
6.3%
C 562
 
6.1%
Other values (7) 3263
35.5%
Common
ValueCountFrequency (%)
1 14185
20.0%
2 12655
17.9%
7 12033
17.0%
3 5289
 
7.5%
6 5196
 
7.3%
4 4608
 
6.5%
5 4369
 
6.2%
8 4313
 
6.1%
0 4176
 
5.9%
9 3989
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 14185
17.7%
2 12655
15.8%
7 12033
15.0%
3 5289
 
6.6%
6 5196
 
6.5%
4 4608
 
5.8%
5 4369
 
5.5%
8 4313
 
5.4%
0 4176
 
5.2%
9 3989
 
5.0%
Other values (17) 9187
11.5%

지역구분
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
농어촌
4321 
<NA>
2866 
주택가
2710 
번화가
 
99
공란
 
4

Length

Max length4
Median length3
Mean length3.2862
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row주택가
4th row주택가
5th row주택가

Common Values

ValueCountFrequency (%)
농어촌 4321
43.2%
<NA> 2866
28.7%
주택가 2710
27.1%
번화가 99
 
1.0%
공란 4
 
< 0.1%

Length

2023-12-13T00:22:41.972467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:22:42.142097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농어촌 4321
43.2%
na 2866
28.7%
주택가 2710
27.1%
번화가 99
 
1.0%
공란 4
 
< 0.1%

Correlations

2023-12-13T00:22:42.232931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2차순번2차사업소지역구분
2차순번1.0001.0000.154
2차사업소1.0001.0000.154
지역구분0.1540.1541.000
2023-12-13T00:22:42.339192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분2차사업소2차순번
지역구분1.0000.1020.102
2차사업소0.1021.0001.000
2차순번0.1021.0001.000
2023-12-13T00:22:42.438668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2차순번2차사업소지역구분
2차순번1.0001.0000.102
2차사업소1.0001.0000.102
지역구분0.1020.1021.000

Missing values

2023-12-13T00:22:39.476396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:22:39.608370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

1차순번1차본부2차순번2차사업소전산화번호지역구분
494877충북본부1충북본부직할1872D031<NA>
695027충북본부2동청주지사2179P181<NA>
221197충북본부1충북본부직할2074F271주택가
359137충북본부1충북본부직할2172P281주택가
705077충북본부2동청주지사2178F871주택가
472837충북본부1충북본부직할1974A103<NA>
281597충북본부1충북본부직할2273A121주택가
166617충북본부1충북본부직할1776P731농어촌
393787충북본부1충북본부직할2170P792농어촌
2377충북본부1충북본부직할1968Y363농어촌
1차순번1차본부2차순번2차사업소전산화번호지역구분
720507충북본부2동청주지사19803552주택가
419647충북본부1충북본부직할2272G723농어촌
801127충북본부2동청주지사2080H201<NA>
847447충북본부2동청주지사2275R591주택가
390697충북본부1충북본부직할1875P711농어촌
869937충북본부2동청주지사2371R211농어촌
298827충북본부1충북본부직할1968C671농어촌
535427충북본부1충북본부직할1475S871<NA>
935217충북본부2동청주지사3373C772<NA>
124417충북본부1충북본부직할1874H221<NA>