Overview

Dataset statistics

Number of variables4
Number of observations60
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory37.2 B

Variable types

Categorical2
Numeric2

Dataset

Description한국전기안전공사에서 최근 3년 전기안전점검 대상(청소년수련시설, 비디오시청제공업, 게임제공업, 노래연습장, 단란주점, 유흥주점, 어린이집, 공연장, 종합병원, 호텔업, 숙박업, 농어촌민박, 자가용Ev충전시설, 기타) 및 불합격 건수를 확인 할 수 있는 데이터입니다.
URLhttps://www.data.go.kr/data/15044358/fileData.do

Alerts

점검대상 is highly overall correlated with 불합격High correlation
불합격 is highly overall correlated with 점검대상High correlation
점검대상 has 4 (6.7%) zerosZeros
불합격 has 9 (15.0%) zerosZeros

Reproduction

Analysis started2023-12-12 20:08:00.074886
Analysis finished2023-12-12 20:08:00.771550
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct24
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
청소년수련시설
 
4
노래연습장
 
4
단란주점
 
4
유흥주점
 
4
어린이집
 
4
Other values (19)
40 

Length

Max length13
Median length10
Mean length4.85
Min length2

Unique

Unique10 ?
Unique (%)16.7%

Sample

1st row청소년수련시설
2nd row비디오시청제공업
3rd row게임제공업
4th row노래연습장
5th row단란주점

Common Values

ValueCountFrequency (%)
청소년수련시설 4
 
6.7%
노래연습장 4
 
6.7%
단란주점 4
 
6.7%
유흥주점 4
 
6.7%
어린이집 4
 
6.7%
공연장 4
 
6.7%
종합병원 4
 
6.7%
기타 4
 
6.7%
숙박업 3
 
5.0%
게임제공업 3
 
5.0%
Other values (14) 22
36.7%

Length

2023-12-13T05:08:00.851721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
청소년수련시설 4
 
6.3%
단란주점 4
 
6.3%
유흥주점 4
 
6.3%
어린이집 4
 
6.3%
공연장 4
 
6.3%
종합병원 4
 
6.3%
기타 4
 
6.3%
노래연습장 4
 
6.3%
자가용 3
 
4.8%
농어촌민박 3
 
4.8%
Other values (15) 25
39.7%

연도
Categorical

Distinct4
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
2022
18 
2019
14 
2020
14 
2021
14 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2022 18
30.0%
2019 14
23.3%
2020 14
23.3%
2021 14
23.3%

Length

2023-12-13T05:08:00.978463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:08:01.074831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 18
30.0%
2019 14
23.3%
2020 14
23.3%
2021 14
23.3%

점검대상
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct56
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1605.8
Minimum0
Maximum33383
Zeros4
Zeros (%)6.7%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-13T05:08:01.189052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q158.5
median285.5
Q3998.5
95-th percentile6376.35
Maximum33383
Range33383
Interquartile range (IQR)940

Descriptive statistics

Standard deviation4571.4448
Coefficient of variation (CV)2.8468332
Kurtosis40.787938
Mean1605.8
Median Absolute Deviation (MAD)269.5
Skewness5.9795515
Sum96348
Variance20898108
MonotonicityNot monotonic
2023-12-13T05:08:01.337901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4
 
6.7%
16 2
 
3.3%
152 1
 
1.7%
930 1
 
1.7%
124 1
 
1.7%
328 1
 
1.7%
133 1
 
1.7%
1931 1
 
1.7%
1027 1
 
1.7%
5481 1
 
1.7%
Other values (46) 46
76.7%
ValueCountFrequency (%)
0 4
6.7%
1 1
 
1.7%
2 1
 
1.7%
3 1
 
1.7%
15 1
 
1.7%
16 2
3.3%
18 1
 
1.7%
25 1
 
1.7%
31 1
 
1.7%
39 1
 
1.7%
ValueCountFrequency (%)
33383 1
1.7%
7368 1
1.7%
6497 1
1.7%
6370 1
1.7%
6171 1
1.7%
5481 1
1.7%
4702 1
1.7%
4067 1
1.7%
3419 1
1.7%
1931 1
1.7%

불합격
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct41
Distinct (%)68.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98.4
Minimum0
Maximum1307
Zeros9
Zeros (%)15.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-13T05:08:01.740395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14.75
median20
Q3102.75
95-th percentile459.05
Maximum1307
Range1307
Interquartile range (IQR)98

Descriptive statistics

Standard deviation218.34277
Coefficient of variation (CV)2.2189306
Kurtosis17.510831
Mean98.4
Median Absolute Deviation (MAD)19
Skewness3.919328
Sum5904
Variance47673.566
MonotonicityNot monotonic
2023-12-13T05:08:01.880517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
0 9
 
15.0%
10 4
 
6.7%
1 4
 
6.7%
5 3
 
5.0%
38 2
 
3.3%
16 2
 
3.3%
21 2
 
3.3%
13 1
 
1.7%
286 1
 
1.7%
1307 1
 
1.7%
Other values (31) 31
51.7%
ValueCountFrequency (%)
0 9
15.0%
1 4
6.7%
2 1
 
1.7%
4 1
 
1.7%
5 3
 
5.0%
7 1
 
1.7%
10 4
6.7%
13 1
 
1.7%
15 1
 
1.7%
16 2
 
3.3%
ValueCountFrequency (%)
1307 1
1.7%
810 1
1.7%
650 1
1.7%
449 1
1.7%
303 1
1.7%
286 1
1.7%
225 1
1.7%
162 1
1.7%
159 1
1.7%
154 1
1.7%

Interactions

2023-12-13T05:08:00.444146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:08:00.243275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:08:00.539222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:08:00.347521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:08:01.965566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분연도점검대상불합격
구분1.0000.0000.4560.000
연도0.0001.0000.0000.000
점검대상0.4560.0001.0000.919
불합격0.0000.0000.9191.000
2023-12-13T05:08:02.065471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분연도
구분1.0000.000
연도0.0001.000
2023-12-13T05:08:02.143098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
점검대상불합격구분연도
점검대상1.0000.9440.1670.000
불합격0.9441.0000.0000.000
구분0.1670.0001.0000.000
연도0.0000.0000.0001.000

Missing values

2023-12-13T05:08:00.661103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:08:00.738009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분연도점검대상불합격
0청소년수련시설20191525
1비디오시청제공업2019311
2게임제공업20196497303
3노래연습장20191687162
4단란주점201940425
5유흥주점201940246
6어린이집20191334101
7공연장20197719
8종합병원201936016
9호텔업201913921
구분연도점검대상불합격
50유치원2022714
51문화재2022160
52공연장20228713
53영화상영관2022251
54대규모점포202230
55종합병원20222665
56호텔202212215
57카지노202220
58국제회의장202210
59기타2022333831307