Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows597
Duplicate rows (%)6.0%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

DateTime1
Text2
Categorical1

Dataset

Description한국전기안전공사에서 제공하는 2019년 경상북도 사용전점검 결과 데이터입니다. 점검일자, 주소, 건물유형, 점검결과를 확인하실 수 있습니다.
Author한국전기안전공사
URLhttps://www.data.go.kr/data/15103227/fileData.do

Alerts

Dataset has 597 (6.0%) duplicate rowsDuplicates
결과 is highly imbalanced (55.3%)Imbalance

Reproduction

Analysis started2023-12-12 15:29:14.804631
Analysis finished2023-12-12 15:29:15.337790
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct322
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-01-02 00:00:00
Maximum2019-12-31 00:00:00
2023-12-13T00:29:15.438186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:29:15.596351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

주소
Text

Distinct2376
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:29:15.984042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length15.3538
Min length10

Characters and Unicode

Total characters153538
Distinct characters285
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique640 ?
Unique (%)6.4%

Sample

1st row경상북도 김천시 아포읍 봉산리
2nd row경상북도 군위군 효령면 마시리
3rd row경상북도 김천시 율곡동
4th row경상북도 영주시 고현동
5th row경상북도 경산시 남방동
ValueCountFrequency (%)
경상북도 9896
25.5%
포항시 1136
 
2.9%
구미시 935
 
2.4%
경산시 935
 
2.4%
상주시 835
 
2.2%
김천시 707
 
1.8%
북구 598
 
1.5%
안동시 563
 
1.5%
남구 538
 
1.4%
영주시 480
 
1.2%
Other values (2088) 22143
57.1%
2023-12-13T00:29:16.518780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28766
18.7%
11544
 
7.5%
11118
 
7.2%
10946
 
7.1%
10787
 
7.0%
7742
 
5.0%
6518
 
4.2%
5439
 
3.5%
3728
 
2.4%
3576
 
2.3%
Other values (275) 53374
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 124768
81.3%
Space Separator 28766
 
18.7%
Decimal Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11544
 
9.3%
11118
 
8.9%
10946
 
8.8%
10787
 
8.6%
7742
 
6.2%
6518
 
5.2%
5439
 
4.4%
3728
 
3.0%
3576
 
2.9%
2465
 
2.0%
Other values (272) 50905
40.8%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 2
50.0%
Space Separator
ValueCountFrequency (%)
28766
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 124768
81.3%
Common 28770
 
18.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11544
 
9.3%
11118
 
8.9%
10946
 
8.8%
10787
 
8.6%
7742
 
6.2%
6518
 
5.2%
5439
 
4.4%
3728
 
3.0%
3576
 
2.9%
2465
 
2.0%
Other values (272) 50905
40.8%
Common
ValueCountFrequency (%)
28766
> 99.9%
2 2
 
< 0.1%
1 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 124768
81.3%
ASCII 28770
 
18.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28766
> 99.9%
2 2
 
< 0.1%
1 2
 
< 0.1%
Hangul
ValueCountFrequency (%)
11544
 
9.3%
11118
 
8.9%
10946
 
8.8%
10787
 
8.6%
7742
 
6.2%
6518
 
5.2%
5439
 
4.4%
3728
 
3.0%
3576
 
2.9%
2465
 
2.0%
Other values (272) 50905
40.8%
Distinct54
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:29:16.725176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.1184
Min length2

Characters and Unicode

Total characters41184
Distinct characters124
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st rowCCTV
2nd row태양광설비(3년)
3rd row일반음식점
4th row태양광설비(3년)
5th row보안등
ValueCountFrequency (%)
이외시설 1719
16.1%
농사용 1532
14.4%
보안등 1374
12.9%
태양광설비(3년 1284
12.1%
임시 687
 
6.5%
우사 662
 
6.2%
단독 560
 
5.3%
기타 367
 
3.4%
cctv 351
 
3.3%
아파트 309
 
2.9%
Other values (47) 1804
16.9%
2023-12-13T00:29:17.079045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3192
 
7.8%
2589
 
6.3%
2226
 
5.4%
1722
 
4.2%
1719
 
4.2%
1646
 
4.0%
1571
 
3.8%
1532
 
3.7%
1411
 
3.4%
1374
 
3.3%
Other values (114) 22202
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35223
85.5%
Uppercase Letter 1408
 
3.4%
Open Punctuation 1284
 
3.1%
Decimal Number 1284
 
3.1%
Close Punctuation 1284
 
3.1%
Space Separator 649
 
1.6%
Other Punctuation 52
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3192
 
9.1%
2589
 
7.4%
2226
 
6.3%
1722
 
4.9%
1719
 
4.9%
1646
 
4.7%
1571
 
4.5%
1532
 
4.3%
1411
 
4.0%
1374
 
3.9%
Other values (103) 16241
46.1%
Uppercase Letter
ValueCountFrequency (%)
C 704
50.0%
V 351
24.9%
T 351
24.9%
P 2
 
0.1%
Other Punctuation
ValueCountFrequency (%)
/ 32
61.5%
, 18
34.6%
· 2
 
3.8%
Open Punctuation
ValueCountFrequency (%)
( 1284
100.0%
Decimal Number
ValueCountFrequency (%)
3 1284
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1284
100.0%
Space Separator
ValueCountFrequency (%)
649
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35223
85.5%
Common 4553
 
11.1%
Latin 1408
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3192
 
9.1%
2589
 
7.4%
2226
 
6.3%
1722
 
4.9%
1719
 
4.9%
1646
 
4.7%
1571
 
4.5%
1532
 
4.3%
1411
 
4.0%
1374
 
3.9%
Other values (103) 16241
46.1%
Common
ValueCountFrequency (%)
( 1284
28.2%
3 1284
28.2%
) 1284
28.2%
649
14.3%
/ 32
 
0.7%
, 18
 
0.4%
· 2
 
< 0.1%
Latin
ValueCountFrequency (%)
C 704
50.0%
V 351
24.9%
T 351
24.9%
P 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35223
85.5%
ASCII 5959
 
14.5%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3192
 
9.1%
2589
 
7.4%
2226
 
6.3%
1722
 
4.9%
1719
 
4.9%
1646
 
4.7%
1571
 
4.5%
1532
 
4.3%
1411
 
4.0%
1374
 
3.9%
Other values (103) 16241
46.1%
ASCII
ValueCountFrequency (%)
( 1284
21.5%
3 1284
21.5%
) 1284
21.5%
C 704
11.8%
649
10.9%
V 351
 
5.9%
T 351
 
5.9%
/ 32
 
0.5%
, 18
 
0.3%
P 2
 
< 0.1%
None
ValueCountFrequency (%)
· 2
100.0%

결과
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
적합
9069 
부적합
931 

Length

Max length3
Median length2
Mean length2.0931
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적합
2nd row적합
3rd row적합
4th row적합
5th row적합

Common Values

ValueCountFrequency (%)
적합 9069
90.7%
부적합 931
 
9.3%

Length

2023-12-13T00:29:17.220706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:29:17.311134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
적합 9069
90.7%
부적합 931
 
9.3%

Correlations

2023-12-13T00:29:17.370009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물유형결과
건물유형1.0000.173
결과0.1731.000

Missing values

2023-12-13T00:29:15.205848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:29:15.294481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

점검일자주소건물유형결과
243772019-07-09경상북도 김천시 아포읍 봉산리CCTV적합
325532019-05-10경상북도 군위군 효령면 마시리태양광설비(3년)적합
269452019-09-20경상북도 김천시 율곡동일반음식점적합
423192019-06-24경상북도 영주시 고현동태양광설비(3년)적합
35202019-06-12경상북도 경산시 남방동보안등적합
12892019-03-11경상북도 영천시 대창면 용호리일반음식점적합
309252019-02-26경상북도 구미시 고아읍 오로리보안등적합
111322019-04-23경상북도 영덕군 강구면 금호리태양광설비(3년)적합
430352019-07-10경상북도 예천군 풍양면 낙상리보안등적합
45652019-07-16경상북도 영천시 화남면 온천리보안등적합
점검일자주소건물유형결과
492202019-08-08경상북도 고령군 대가야읍 쾌빈리CCTV적합
287592019-11-29경상북도 김천시 교동농사용적합
182862019-12-31경상북도 영덕군 남정면 부경리아파트적합
191762019-02-07경상북도 상주시 모동면 덕곡리단독적합
494142019-09-07경상북도 고령군 성산면 박곡리기타 조명적합
426282019-06-30경상북도 안동시 풍산읍 괴정리보안등적합
245132019-07-09경상북도 김천시 아포읍 제석리CCTV적합
432012019-07-15경상북도 영주시 단산면 병산리농사용적합
121392019-05-27경상북도 포항시 남구 해도동전기차 충전시설적합
410352019-05-23경상북도 예천군 감천면 수한리CCTV적합

Duplicate rows

Most frequently occurring

점검일자주소건물유형결과# duplicates
5852019-12-23경상북도 포항시 남구 연일읍 자명리아파트적합11
1322019-04-04경북 구미시 해평면 낙성리보안등적합10
1602019-04-24경북 구미시 해평면 도문리보안등적합10
3262019-07-18경상북도 구미시 거의동연립,다세대적합10
152019-01-11경상북도 김천시 부곡동이외시설적합9
1052019-03-13경북 구미시 해평면 해평리보안등적합8
3972019-09-02경상북도 청송군 파천면 송강리태양광설비(3년)적합8
882019-02-22경상북도 구미시 고아읍 오로리보안등적합7
1362019-04-05경북 구미시 해평면 금호리보안등적합7
1582019-04-23경상북도 영주시 가흥동아파트적합7