Overview

Dataset statistics

Number of variables6
Number of observations422
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.7 KiB
Average record size in memory50.3 B

Variable types

DateTime2
Numeric2
Categorical2

Dataset

Description한국중부발전(주)의 견학신청 현황 정보이며, 목록은 '등록일시', '견학일자', '신청 순번', '시설물', '사용희망인원수', '견학 유형'으로 이루어져 있음.
URLhttps://www.data.go.kr/data/15119088/fileData.do

Alerts

사용희망인원수 is highly overall correlated with 견학 유형High correlation
견학 유형 is highly overall correlated with 사용희망인원수High correlation
시설물 is highly imbalanced (51.4%)Imbalance
신청 순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:06:32.485025
Analysis finished2023-12-12 23:06:33.310819
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct200
Distinct (%)47.4%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
Minimum2022-07-01 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T08:06:33.388321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:06:33.554633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct200
Distinct (%)47.4%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
Minimum2022-07-08 00:00:00
Maximum2023-10-26 00:00:00
2023-12-13T08:06:33.721740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:06:33.858230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신청 순번
Real number (ℝ)

UNIQUE 

Distinct422
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11279.936
Minimum10954
Maximum11600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.8 KiB
2023-12-13T08:06:33.987443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10954
5-th percentile10976.05
Q111120.25
median11280.5
Q311442.75
95-th percentile11559.95
Maximum11600
Range646
Interquartile range (IQR)322.5

Descriptive statistics

Standard deviation184.65633
Coefficient of variation (CV)0.016370335
Kurtosis-1.1738123
Mean11279.936
Median Absolute Deviation (MAD)161.5
Skewness0.015189296
Sum4760133
Variance34097.96
MonotonicityStrictly increasing
2023-12-13T08:06:34.137270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10954 1
 
0.2%
11359 1
 
0.2%
11387 1
 
0.2%
11386 1
 
0.2%
11385 1
 
0.2%
11384 1
 
0.2%
11383 1
 
0.2%
11382 1
 
0.2%
11381 1
 
0.2%
11380 1
 
0.2%
Other values (412) 412
97.6%
ValueCountFrequency (%)
10954 1
0.2%
10955 1
0.2%
10956 1
0.2%
10957 1
0.2%
10958 1
0.2%
10959 1
0.2%
10960 1
0.2%
10961 1
0.2%
10962 1
0.2%
10963 1
0.2%
ValueCountFrequency (%)
11600 1
0.2%
11599 1
0.2%
11596 1
0.2%
11595 1
0.2%
11594 1
0.2%
11593 1
0.2%
11592 1
0.2%
11591 1
0.2%
11590 1
0.2%
11589 1
0.2%

시설물
Categorical

IMBALANCE 

Distinct7
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
서울발전본부
321 
제주발전본부
 
27
세종발전본부
 
25
신보령발전본부
 
18
인천발전본부
 
16
Other values (2)
 
15

Length

Max length15
Median length6
Mean length6.1350711
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row세종발전본부
2nd row제주발전본부
3rd row인천발전본부
4th row인천발전본부
5th row보령에너지월드(보령화력본부)

Common Values

ValueCountFrequency (%)
서울발전본부 321
76.1%
제주발전본부 27
 
6.4%
세종발전본부 25
 
5.9%
신보령발전본부 18
 
4.3%
인천발전본부 16
 
3.8%
신서천발전본부 12
 
2.8%
보령에너지월드(보령화력본부) 3
 
0.7%

Length

2023-12-13T08:06:34.266937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:06:34.376168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울발전본부 321
76.1%
제주발전본부 27
 
6.4%
세종발전본부 25
 
5.9%
신보령발전본부 18
 
4.3%
인천발전본부 16
 
3.8%
신서천발전본부 12
 
2.8%
보령에너지월드(보령화력본부 3
 
0.7%

사용희망인원수
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.3578199
Minimum0
Maximum38
Zeros2
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size3.8 KiB
2023-12-13T08:06:34.491247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13.25
median9
Q315
95-th percentile20
Maximum38
Range38
Interquartile range (IQR)11.75

Descriptive statistics

Standard deviation6.4153485
Coefficient of variation (CV)0.68556016
Kurtosis-0.2235109
Mean9.3578199
Median Absolute Deviation (MAD)6
Skewness0.4275253
Sum3949
Variance41.156696
MonotonicityNot monotonic
2023-12-13T08:06:34.624816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
15 98
23.2%
1 53
12.6%
20 36
 
8.5%
5 34
 
8.1%
3 28
 
6.6%
4 23
 
5.5%
2 23
 
5.5%
10 20
 
4.7%
8 17
 
4.0%
12 16
 
3.8%
Other values (13) 74
17.5%
ValueCountFrequency (%)
0 2
 
0.5%
1 53
12.6%
2 23
5.5%
3 28
6.6%
4 23
5.5%
5 34
8.1%
6 13
 
3.1%
7 12
 
2.8%
8 17
 
4.0%
9 13
 
3.1%
ValueCountFrequency (%)
38 1
 
0.2%
31 1
 
0.2%
23 2
 
0.5%
20 36
 
8.5%
19 1
 
0.2%
18 1
 
0.2%
17 1
 
0.2%
15 98
23.2%
14 9
 
2.1%
13 7
 
1.7%

견학 유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
기관
342 
개인
80 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기관
2nd row기관
3rd row개인
4th row개인
5th row기관

Common Values

ValueCountFrequency (%)
기관 342
81.0%
개인 80
 
19.0%

Length

2023-12-13T08:06:34.769548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:06:34.864679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기관 342
81.0%
개인 80
 
19.0%

Interactions

2023-12-13T08:06:32.862244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:06:32.651626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:06:32.980425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:06:32.749738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:06:34.932814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청 순번시설물사용희망인원수견학 유형
신청 순번1.0000.3190.1940.266
시설물0.3191.0000.6140.258
사용희망인원수0.1940.6141.0000.614
견학 유형0.2660.2580.6141.000
2023-12-13T08:06:35.026650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설물견학 유형
시설물1.0000.274
견학 유형0.2741.000
2023-12-13T08:06:35.128049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청 순번사용희망인원수시설물견학 유형
신청 순번1.0000.0520.1690.200
사용희망인원수0.0521.0000.3830.614
시설물0.1690.3831.0000.274
견학 유형0.2000.6140.2741.000

Missing values

2023-12-13T08:06:33.135661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:06:33.251769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록일시견학일자신청 순번시설물사용희망인원수견학 유형
02022-07-012022-07-0810954세종발전본부10기관
12022-07-012022-07-0810955제주발전본부23기관
22022-07-042022-08-0810956인천발전본부3개인
32022-07-042022-08-0810957인천발전본부3개인
42022-07-052022-07-1510958보령에너지월드(보령화력본부)38기관
52022-07-052022-07-2810959제주발전본부23기관
62022-07-062022-07-2310960보령에너지월드(보령화력본부)4개인
72022-07-132022-07-2210961세종발전본부8기관
82022-07-142022-08-1110962세종발전본부10기관
92022-07-152022-08-2610963세종발전본부15기관
등록일시견학일자신청 순번시설물사용희망인원수견학 유형
4122023-06-272023-07-1111589제주발전본부4기관
4132023-06-272023-07-0611590서울발전본부15기관
4142023-06-282023-07-0611591서울발전본부15기관
4152023-06-282023-10-1311592인천발전본부5기관
4162023-06-282023-08-0411593서울발전본부12기관
4172023-06-282023-08-0411594서울발전본부4기관
4182023-06-292023-06-2911595서울발전본부4기관
4192023-06-292023-06-2911596서울발전본부2기관
4202023-06-292023-07-0411599서울발전본부6기관
4212023-06-302023-07-1011600서울발전본부2기관