Overview

Dataset statistics

Number of variables8
Number of observations8657
Missing cells4060
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory600.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical3

Dataset

Description한국인터넷진흥원 국가기술자격시험(KISQ : 정보보안기사 / 정보보안산업기사)의 시험별시험장교실정보입니다.
Author한국인터넷진흥원
URLhttps://www.data.go.kr/data/15092540/fileData.do

Alerts

시험회차 is highly overall correlated with 인원수High correlation
인원수 is highly overall correlated with 시험회차High correlation
건물명 is highly imbalanced (60.2%)Imbalance
장애인좌석수 is highly imbalanced (94.9%)Imbalance
추가좌석수 has 4060 (46.9%) missing valuesMissing
추가좌석수 has 454 (5.2%) zerosZeros

Reproduction

Analysis started2023-12-12 20:20:57.586117
Analysis finished2023-12-12 20:21:01.183957
Duration3.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시험회차
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.2687998
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size76.2 KiB
2023-12-13T05:21:01.236490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median9
Q314
95-th percentile17
Maximum18
Range17
Interquartile range (IQR)10

Descriptive statistics

Standard deviation5.4672769
Coefficient of variation (CV)0.58985812
Kurtosis-1.3217338
Mean9.2687998
Median Absolute Deviation (MAD)5
Skewness-0.049257404
Sum80240
Variance29.891116
MonotonicityDecreasing
2023-12-13T05:21:01.341654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2 978
 
11.3%
1 679
 
7.8%
15 632
 
7.3%
16 534
 
6.2%
17 529
 
6.1%
9 488
 
5.6%
14 474
 
5.5%
8 452
 
5.2%
13 443
 
5.1%
10 441
 
5.1%
Other values (8) 3007
34.7%
ValueCountFrequency (%)
1 679
7.8%
2 978
11.3%
3 311
 
3.6%
4 318
 
3.7%
5 339
 
3.9%
6 372
 
4.3%
7 432
5.0%
8 452
5.2%
9 488
5.6%
10 441
5.1%
ValueCountFrequency (%)
18 372
4.3%
17 529
6.1%
16 534
6.2%
15 632
7.3%
14 474
5.5%
13 443
5.1%
12 430
5.0%
11 433
5.0%
10 441
5.1%
9 488
5.6%

시험장일련번호
Real number (ℝ)

Distinct73
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.522583
Minimum1
Maximum73
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size76.2 KiB
2023-12-13T05:21:01.476691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median17
Q331
95-th percentile56
Maximum73
Range72
Interquartile range (IQR)25

Descriptive statistics

Standard deviation16.987702
Coefficient of variation (CV)0.82775651
Kurtosis0.10016115
Mean20.522583
Median Absolute Deviation (MAD)12
Skewness0.90659299
Sum177664
Variance288.582
MonotonicityNot monotonic
2023-12-13T05:21:01.624099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 479
 
5.5%
1 409
 
4.7%
4 383
 
4.4%
2 365
 
4.2%
5 364
 
4.2%
7 342
 
4.0%
6 314
 
3.6%
9 308
 
3.6%
8 270
 
3.1%
11 231
 
2.7%
Other values (63) 5192
60.0%
ValueCountFrequency (%)
1 409
4.7%
2 365
4.2%
3 479
5.5%
4 383
4.4%
5 364
4.2%
6 314
3.6%
7 342
4.0%
8 270
3.1%
9 308
3.6%
10 216
2.5%
ValueCountFrequency (%)
73 9
 
0.1%
72 10
 
0.1%
71 10
 
0.1%
70 4
 
< 0.1%
69 15
 
0.2%
68 16
 
0.2%
67 30
0.3%
66 34
0.4%
65 40
0.5%
64 24
0.3%

교실번호
Real number (ℝ)

Distinct43
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.90782
Minimum1
Maximum43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size76.2 KiB
2023-12-13T05:21:01.783314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median9
Q316
95-th percentile28
Maximum43
Range42
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.60794
Coefficient of variation (CV)0.78915309
Kurtosis0.26034713
Mean10.90782
Median Absolute Deviation (MAD)6
Skewness0.94287192
Sum94429
Variance74.096631
MonotonicityNot monotonic
2023-12-13T05:21:01.959562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 696
 
8.0%
2 680
 
7.9%
3 635
 
7.3%
4 577
 
6.7%
5 493
 
5.7%
6 440
 
5.1%
7 395
 
4.6%
8 366
 
4.2%
9 336
 
3.9%
10 320
 
3.7%
Other values (33) 3719
43.0%
ValueCountFrequency (%)
1 696
8.0%
2 680
7.9%
3 635
7.3%
4 577
6.7%
5 493
5.7%
6 440
5.1%
7 395
4.6%
8 366
4.2%
9 336
3.9%
10 320
3.7%
ValueCountFrequency (%)
43 1
 
< 0.1%
42 4
 
< 0.1%
41 8
 
0.1%
40 12
0.1%
39 12
0.1%
38 13
0.2%
37 14
0.2%
36 18
0.2%
35 24
0.3%
34 28
0.3%

인원수
Real number (ℝ)

HIGH CORRELATION 

Distinct44
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.927804
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size76.2 KiB
2023-12-13T05:21:02.118584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18
Q123
median25
Q330
95-th percentile33
Maximum90
Range89
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.3744909
Coefficient of variation (CV)0.20728677
Kurtosis10.30925
Mean25.927804
Median Absolute Deviation (MAD)5
Skewness1.3612971
Sum224457
Variance28.885152
MonotonicityNot monotonic
2023-12-13T05:21:02.256283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
30 2459
28.4%
25 1314
15.2%
23 768
 
8.9%
19 545
 
6.3%
24 484
 
5.6%
22 408
 
4.7%
20 406
 
4.7%
18 315
 
3.6%
29 306
 
3.5%
28 264
 
3.0%
Other values (34) 1388
16.0%
ValueCountFrequency (%)
1 6
 
0.1%
2 1
 
< 0.1%
3 1
 
< 0.1%
5 1
 
< 0.1%
12 2
 
< 0.1%
13 3
 
< 0.1%
14 8
 
0.1%
15 18
 
0.2%
16 72
0.8%
17 96
1.1%
ValueCountFrequency (%)
90 1
 
< 0.1%
72 7
0.1%
70 2
 
< 0.1%
60 11
0.1%
56 1
 
< 0.1%
54 5
0.1%
50 4
 
< 0.1%
49 10
0.1%
48 12
0.1%
45 6
0.1%

건물명
Categorical

IMBALANCE 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size67.8 KiB
정보없음
5618 
본관
2041 
숭인관
 
212
1학년관
 
175
3학년관
 
134
Other values (12)
 
477

Length

Max length12
Median length4
Mean length3.4894305
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1학년관
2nd row1학년관
3rd row1학년관
4th row1학년관
5th row1학년관

Common Values

ValueCountFrequency (%)
정보없음 5618
64.9%
본관 2041
 
23.6%
숭인관 212
 
2.4%
1학년관 175
 
2.0%
3학년관 134
 
1.5%
2학년관 110
 
1.3%
성실관 109
 
1.3%
북두관 74
 
0.9%
후관 56
 
0.6%
봉사관 42
 
0.5%
Other values (7) 86
 
1.0%

Length

2023-12-13T05:21:02.415839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
정보없음 5618
64.9%
본관 2041
 
23.6%
숭인관 212
 
2.4%
1학년관 175
 
2.0%
3학년관 134
 
1.5%
2학년관 110
 
1.3%
성실관 109
 
1.3%
북두관 74
 
0.9%
후관 56
 
0.6%
봉사관 42
 
0.5%
Other values (6) 86
 
1.0%

층명
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size67.8 KiB
3
2619 
2
2538 
4
1416 
<NA>
1318 
1
414 

Length

Max length4
Median length1
Mean length1.4567402
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row2
5th row2

Common Values

ValueCountFrequency (%)
3 2619
30.3%
2 2538
29.3%
4 1416
16.4%
<NA> 1318
15.2%
1 414
 
4.8%
5 352
 
4.1%

Length

2023-12-13T05:21:02.548581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:21:02.700484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 2619
30.3%
2 2538
29.3%
4 1416
16.4%
na 1318
15.2%
1 414
 
4.8%
5 352
 
4.1%

추가좌석수
Real number (ℝ)

MISSING  ZEROS 

Distinct6
Distinct (%)0.1%
Missing4060
Missing (%)46.9%
Infinite0
Infinite (%)0.0%
Mean2.5427453
Minimum0
Maximum5
Zeros454
Zeros (%)5.2%
Negative0
Negative (%)0.0%
Memory size76.2 KiB
2023-12-13T05:21:02.817152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q34
95-th percentile4
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.2871841
Coefficient of variation (CV)0.50621828
Kurtosis-0.67514782
Mean2.5427453
Median Absolute Deviation (MAD)1
Skewness-0.52612979
Sum11689
Variance1.656843
MonotonicityNot monotonic
2023-12-13T05:21:02.936661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3 1462
 
16.9%
4 1176
 
13.6%
2 902
 
10.4%
1 555
 
6.4%
0 454
 
5.2%
5 48
 
0.6%
(Missing) 4060
46.9%
ValueCountFrequency (%)
0 454
 
5.2%
1 555
 
6.4%
2 902
10.4%
3 1462
16.9%
4 1176
13.6%
5 48
 
0.6%
ValueCountFrequency (%)
5 48
 
0.6%
4 1176
13.6%
3 1462
16.9%
2 902
10.4%
1 555
 
6.4%
0 454
 
5.2%

장애인좌석수
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size67.8 KiB
0
8549 
1
 
98
2
 
9
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8549
98.8%
1 98
 
1.1%
2 9
 
0.1%
3 1
 
< 0.1%

Length

2023-12-13T05:21:03.059939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:21:03.199380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8549
98.8%
1 98
 
1.1%
2 9
 
0.1%
3 1
 
< 0.1%

Interactions

2023-12-13T05:21:00.499554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:58.328053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:58.882920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.392537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.818548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:21:00.604273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:58.438157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:58.981715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.484897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.895103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:21:00.684183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:58.563169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.097504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.567405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.978666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:21:00.757522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:58.672795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.194396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.644665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:21:00.338687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:21:00.842303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:58.781806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.280146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:59.731645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:21:00.402833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:21:03.285014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시험회차시험장일련번호교실번호인원수건물명층명추가좌석수장애인좌석수
시험회차1.0000.6850.1650.4930.5780.1890.4340.056
시험장일련번호0.6851.0000.3190.3430.3800.2930.2290.016
교실번호0.1650.3191.0000.1000.2020.7360.2070.128
인원수0.4930.3430.1001.0000.3470.2090.2340.352
건물명0.5780.3800.2020.3471.0000.4050.3060.000
층명0.1890.2930.7360.2090.4051.0000.1880.068
추가좌석수0.4340.2290.2070.2340.3060.1881.0000.069
장애인좌석수0.0560.0160.1280.3520.0000.0680.0691.000
2023-12-13T05:21:03.403559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물명장애인좌석수층명
건물명1.0000.0000.222
장애인좌석수0.0001.0000.056
층명0.2220.0561.000
2023-12-13T05:21:03.506650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시험회차시험장일련번호교실번호인원수추가좌석수건물명층명장애인좌석수
시험회차1.000-0.308-0.043-0.740-0.1410.2670.0800.034
시험장일련번호-0.3081.000-0.1870.156-0.1020.1580.1260.010
교실번호-0.043-0.1871.0000.0780.1850.0800.3930.077
인원수-0.7400.1560.0781.0000.1350.1460.1420.232
추가좌석수-0.141-0.1020.1850.1351.0000.1560.1280.028
건물명0.2670.1580.0800.1460.1561.0000.2220.000
층명0.0800.1260.3930.1420.1280.2221.0000.056
장애인좌석수0.0340.0100.0770.2320.0280.0000.0561.000

Missing values

2023-12-13T05:21:00.953928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:21:01.128038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시험회차시험장일련번호교실번호인원수건물명층명추가좌석수장애인좌석수
018246171학년관320
118245171학년관320
218244171학년관320
318243171학년관220
418242171학년관220
518241181학년관230
61823619정보없음340
71823519정보없음340
81823419정보없음340
91823320정보없음300
시험회차시험장일련번호교실번호인원수건물명층명추가좌석수장애인좌석수
8647111030정보없음<NA><NA>0
864811930정보없음<NA><NA>0
864911830정보없음<NA><NA>0
865011730정보없음<NA><NA>0
865111630정보없음<NA><NA>0
865211530정보없음<NA><NA>0
865311430정보없음<NA><NA>0
865411330정보없음<NA><NA>0
865511230정보없음<NA><NA>0
865611130정보없음<NA><NA>0