Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

Categorical6
DateTime1
Numeric1

Dataset

Description한국교통안전공단에서 담당하고 있는 항공종사자(조종사, 항공교통관제사, 항공정비사, 운항관리사 등) 자격증명 년도별(2019~2023) 취득자 정보입니다.
Author한국교통안전공단
URLhttps://www.data.go.kr/data/15087792/fileData.do

Alerts

자격명 is highly overall correlated with 항공기종류High correlation
항공기종류 is highly overall correlated with 자격명 and 1 other fieldsHigh correlation
등급 is highly overall correlated with 항공기종류High correlation
항공기종류 is highly imbalanced (71.6%)Imbalance
성별 is highly imbalanced (64.2%)Imbalance

Reproduction

Analysis started2024-03-14 20:33:02.608887
Analysis finished2024-03-14 20:33:04.730069
Duration2.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자격명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
항공정비사
3566 
사업용조종사
3220 
자가용조종사
1412 
운송용조종사
922 
항공교통관제사
 
324
Other values (2)
556 

Length

Max length8
Median length6
Mean length5.7036
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row항공정비사
2nd row운송용조종사
3rd row자가용조종사
4th row운송용조종사
5th row운송용조종사

Common Values

ValueCountFrequency (%)
항공정비사 3566
35.7%
사업용조종사 3220
32.2%
자가용조종사 1412
 
14.1%
운송용조종사 922
 
9.2%
항공교통관제사 324
 
3.2%
운항관리사 278
 
2.8%
경량항공기조종사 278
 
2.8%

Length

2024-03-15T05:33:04.948876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:33:05.299620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
항공정비사 3566
35.7%
사업용조종사 3220
32.2%
자가용조종사 1412
 
14.1%
운송용조종사 922
 
9.2%
항공교통관제사 324
 
3.2%
운항관리사 278
 
2.8%
경량항공기조종사 278
 
2.8%

항공기종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
비행기
8312 
헬리콥터
 
682
<NA>
 
602
조종형비행기
 
267
기체
 
49
Other values (6)
 
88

Length

Max length7
Median length3
Mean length3.2262
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row비행기
2nd row비행기
3rd row비행기
4th row비행기
5th row비행기

Common Values

ValueCountFrequency (%)
비행기 8312
83.1%
헬리콥터 682
 
6.8%
<NA> 602
 
6.0%
조종형비행기 267
 
2.7%
기체 49
 
0.5%
전자전기계기 42
 
0.4%
터빈발동기 32
 
0.3%
경량헬리콥터 10
 
0.1%
활공기 2
 
< 0.1%
동력패러슈트 1
 
< 0.1%

Length

2024-03-15T05:33:05.525218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
비행기 8312
83.1%
헬리콥터 682
 
6.8%
na 602
 
6.0%
조종형비행기 267
 
2.7%
기체 49
 
0.5%
전자전기계기 42
 
0.4%
터빈발동기 32
 
0.3%
경량헬리콥터 10
 
0.1%
활공기 2
 
< 0.1%
동력패러슈트 1
 
< 0.1%

등급
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
4446 
육상단발
2856 
육상다발
2688 
수상단발
 
7
상급
 
2

Length

Max length4
Median length4
Mean length3.9996
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row육상다발
3rd row육상단발
4th row육상다발
5th row육상다발

Common Values

ValueCountFrequency (%)
<NA> 4446
44.5%
육상단발 2856
28.6%
육상다발 2688
26.9%
수상단발 7
 
0.1%
상급 2
 
< 0.1%
수상다발 1
 
< 0.1%

Length

2024-03-15T05:33:05.922651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:33:06.236786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4446
44.5%
육상단발 2856
28.6%
육상다발 2688
26.9%
수상단발 7
 
0.1%
상급 2
 
< 0.1%
수상다발 1
 
< 0.1%
Distinct1079
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-01-03 00:00:00
Maximum2023-12-26 00:00:00
2024-03-15T05:33:06.463485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:33:06.725243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

개인특정코드
Real number (ℝ)

Distinct8677
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.000152 × 109
Minimum1.000012 × 109
Maximum1.0003729 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T05:33:07.367202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.000012 × 109
5-th percentile1.0000559 × 109
Q11.0001042 × 109
median1.0001508 × 109
Q31.0001884 × 109
95-th percentile1.0002816 × 109
Maximum1.0003729 × 109
Range360925
Interquartile range (IQR)84243.75

Descriptive statistics

Standard deviation65934.877
Coefficient of variation (CV)6.5924854 × 10-5
Kurtosis0.34883332
Mean1.000152 × 109
Median Absolute Deviation (MAD)42201.5
Skewness0.6302706
Sum1.000152 × 1013
Variance4.347408 × 109
MonotonicityNot monotonic
2024-03-15T05:33:07.631526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1000170110 4
 
< 0.1%
1000060731 4
 
< 0.1%
1000159279 4
 
< 0.1%
1000113032 4
 
< 0.1%
1000140993 4
 
< 0.1%
1000188481 4
 
< 0.1%
1000126373 3
 
< 0.1%
1000083011 3
 
< 0.1%
1000268305 3
 
< 0.1%
1000211963 3
 
< 0.1%
Other values (8667) 9964
99.6%
ValueCountFrequency (%)
1000011982 1
< 0.1%
1000015575 1
< 0.1%
1000015902 1
< 0.1%
1000020376 1
< 0.1%
1000021096 1
< 0.1%
1000021182 1
< 0.1%
1000021283 1
< 0.1%
1000022058 1
< 0.1%
1000022098 1
< 0.1%
1000022135 1
< 0.1%
ValueCountFrequency (%)
1000372907 1
< 0.1%
1000372822 1
< 0.1%
1000370408 1
< 0.1%
1000369038 1
< 0.1%
1000368220 1
< 0.1%
1000367680 1
< 0.1%
1000367656 1
< 0.1%
1000367614 1
< 0.1%
1000366293 2
< 0.1%
1000366274 1
< 0.1%

나이대
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20대
5533 
30대
2742 
40대
763 
50대
 
534
10대
 
327
Other values (2)
 
101

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20대
2nd row30대
3rd row20대
4th row30대
5th row50대

Common Values

ValueCountFrequency (%)
20대 5533
55.3%
30대 2742
27.4%
40대 763
 
7.6%
50대 534
 
5.3%
10대 327
 
3.3%
60대 97
 
1.0%
70대 4
 
< 0.1%

Length

2024-03-15T05:33:07.932449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:33:08.232495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20대 5533
55.3%
30대 2742
27.4%
40대 763
 
7.6%
50대 534
 
5.3%
10대 327
 
3.3%
60대 97
 
1.0%
70대 4
 
< 0.1%

성별
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
9322 
 
678

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
9322
93.2%
678
 
6.8%

Length

2024-03-15T05:33:08.625951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:33:08.933596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9322
93.2%
678
 
6.8%

주소
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기
2850 
서울
2485 
인천
1044 
경남
577 
부산
495 
Other values (12)
2549 

Length

Max length3
Median length2
Mean length2.0577
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전북
2nd row서울
3rd row경기
4th row경기
5th row서울

Common Values

ValueCountFrequency (%)
경기 2850
28.5%
서울 2485
24.9%
인천 1044
 
10.4%
경남 577
 
5.8%
부산 495
 
5.0%
경북 439
 
4.4%
충남 331
 
3.3%
대구 322
 
3.2%
충북 265
 
2.6%
강원 262
 
2.6%
Other values (7) 930
 
9.3%

Length

2024-03-15T05:33:09.285074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 2850
28.5%
서울 2485
24.9%
인천 1044
 
10.4%
경남 577
 
5.8%
부산 495
 
5.0%
경북 439
 
4.4%
충남 331
 
3.3%
대구 322
 
3.2%
충북 265
 
2.6%
강원 262
 
2.6%
Other values (7) 930
 
9.3%

Interactions

2024-03-15T05:33:03.708062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T05:33:09.520312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자격명항공기종류등급개인특정코드나이대성별주소
자격명1.0000.8440.4830.3950.6550.2750.291
항공기종류0.8441.0000.7180.2260.2560.0560.247
등급0.4830.7181.0000.3720.2860.0300.187
개인특정코드0.3950.2260.3721.0000.6250.1880.290
나이대0.6550.2560.2860.6251.0000.1430.284
성별0.2750.0560.0300.1880.1431.0000.058
주소0.2910.2470.1870.2900.2840.0581.000
2024-03-15T05:33:09.814636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
나이대주소성별등급항공기종류자격명
나이대1.0000.1320.1530.1870.1320.279
주소0.1321.0000.0520.0970.0980.135
성별0.1530.0521.0000.0370.0430.294
등급0.1870.0970.0371.0000.7070.417
항공기종류0.1320.0980.0430.7071.0000.507
자격명0.2790.1350.2940.4170.5071.000
2024-03-15T05:33:10.264445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개인특정코드자격명항공기종류등급나이대성별주소
개인특정코드1.0000.2120.0710.1630.3790.1440.116
자격명0.2121.0000.5070.4170.2790.2940.135
항공기종류0.0710.5071.0000.7070.1320.0430.098
등급0.1630.4170.7071.0000.1870.0370.097
나이대0.3790.2790.1320.1871.0000.1530.132
성별0.1440.2940.0430.0370.1531.0000.052
주소0.1160.1350.0980.0970.1320.0521.000

Missing values

2024-03-15T05:33:04.096631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:33:04.531956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자격명항공기종류등급취득연월일개인특정코드나이대성별주소
8927항공정비사비행기<NA>2020-11-02100015316620대전북
1334운송용조종사비행기육상다발2021-08-16100006148030대서울
4113자가용조종사비행기육상단발2019-12-10100010105120대경기
2749운송용조종사비행기육상다발2021-09-13100008781230대경기
15854운송용조종사비행기육상다발2021-12-17100026021350대서울
5213사업용조종사헬리콥터육상단발2019-02-11100011159330대경기
16052자가용조종사비행기육상다발2022-06-20100027017350대서울
1840사업용조종사비행기육상다발2020-11-25100007081830대서울
8670항공정비사비행기<NA>2021-11-01100015103920대경북
14672항공정비사비행기<NA>2023-01-25100021526420대경기
자격명항공기종류등급취득연월일개인특정코드나이대성별주소
4465사업용조종사비행기육상단발2019-02-20100010528420대경기
6909항공정비사비행기<NA>2020-03-03100013003220대부산
11270운송용조종사비행기육상다발2019-08-29100017170350대서울
11538항공정비사비행기<NA>2021-05-18100017430320대인천
5577사업용조종사비행기육상다발2019-01-14100011436730대경기
15053경량항공기조종사조종형비행기<NA>2022-08-03100022565240대경기
17025사업용조종사비행기육상단발2023-06-07100034546220대부산
15448항공정비사비행기<NA>2023-11-22100023651320대서울
7299사업용조종사비행기육상단발2020-01-14100013631520대경기
13344사업용조종사비행기육상다발2022-05-23100019306730대전남