Overview

Dataset statistics

Number of variables5
Number of observations595
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.5 KiB
Average record size in memory42.2 B

Variable types

Numeric2
Categorical3

Dataset

Description본교 학생을 대상으로 취창업센터에서 집계한 문화재수리기능자 자격증 취득 현황입니다. 해당 자료는 구두 조사로 진행되어 일부 데이터가 정확하지 않을 수 있습니다. 컬럼 구성은 "연번", "취득년도", "학과", "학년", "종목"으로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15105252/fileData.do

Alerts

연번 is highly overall correlated with 취득년도High correlation
취득년도 is highly overall correlated with 연번High correlation
학과 is highly overall correlated with 종목High correlation
종목 is highly overall correlated with 학과High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:18:56.727186
Analysis finished2023-12-12 14:18:57.898969
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct595
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean298
Minimum1
Maximum595
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2023-12-12T23:18:58.004526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.7
Q1149.5
median298
Q3446.5
95-th percentile565.3
Maximum595
Range594
Interquartile range (IQR)297

Descriptive statistics

Standard deviation171.90598
Coefficient of variation (CV)0.57686571
Kurtosis-1.2
Mean298
Median Absolute Deviation (MAD)149
Skewness0
Sum177310
Variance29551.667
MonotonicityStrictly increasing
2023-12-12T23:18:58.202472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
393 1
 
0.2%
395 1
 
0.2%
396 1
 
0.2%
397 1
 
0.2%
398 1
 
0.2%
399 1
 
0.2%
400 1
 
0.2%
401 1
 
0.2%
402 1
 
0.2%
Other values (585) 585
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
595 1
0.2%
594 1
0.2%
593 1
0.2%
592 1
0.2%
591 1
0.2%
590 1
0.2%
589 1
0.2%
588 1
0.2%
587 1
0.2%
586 1
0.2%

취득년도
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.1277
Minimum2005
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2023-12-12T23:18:58.403915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2005
5-th percentile2008.7
Q12013
median2019
Q32021.5
95-th percentile2023
Maximum2023
Range18
Interquartile range (IQR)8.5

Descriptive statistics

Standard deviation4.8752609
Coefficient of variation (CV)0.0024169322
Kurtosis-1.0360007
Mean2017.1277
Median Absolute Deviation (MAD)3
Skewness-0.50224167
Sum1200191
Variance23.768169
MonotonicityIncreasing
2023-12-12T23:18:58.567020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
2023 78
13.1%
2022 71
11.9%
2019 53
 
8.9%
2021 50
 
8.4%
2020 49
 
8.2%
2013 35
 
5.9%
2018 34
 
5.7%
2015 32
 
5.4%
2009 27
 
4.5%
2011 26
 
4.4%
Other values (9) 140
23.5%
ValueCountFrequency (%)
2005 1
 
0.2%
2006 2
 
0.3%
2007 6
 
1.0%
2008 21
3.5%
2009 27
4.5%
2010 25
4.2%
2011 26
4.4%
2012 24
4.0%
2013 35
5.9%
2014 19
3.2%
ValueCountFrequency (%)
2023 78
13.1%
2022 71
11.9%
2021 50
8.4%
2020 49
8.2%
2019 53
8.9%
2018 34
5.7%
2017 19
 
3.2%
2016 23
 
3.9%
2015 32
5.4%
2014 19
 
3.2%

학과
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
보존
247 
미공
150 
건축
135 
수리기술
40 
조경
 
7
Other values (6)
 
16

Length

Max length5
Median length2
Mean length2.1915966
Min length2

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row건축
2nd row건축
3rd row건축
4th row건축
5th row건축

Common Values

ValueCountFrequency (%)
보존 247
41.5%
미공 150
25.2%
건축 135
22.7%
수리기술 40
 
6.7%
조경 7
 
1.2%
건축(원) 5
 
0.8%
미공(원) 5
 
0.8%
무형 3
 
0.5%
융고 1
 
0.2%
유산융합 1
 
0.2%

Length

2023-12-12T23:18:58.742593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보존 247
41.5%
미공 150
25.2%
건축 135
22.7%
수리기술 40
 
6.7%
조경 7
 
1.2%
건축(원 5
 
0.8%
미공(원 5
 
0.8%
무형 3
 
0.5%
융고 1
 
0.2%
유산융합 1
 
0.2%

학년
Categorical

Distinct7
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
4
203 
3
159 
2
84 
졸업
81 
석사
60 
Other values (2)
 
8

Length

Max length2
Median length1
Mean length1.2386555
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row4
2nd row4
3rd row3
4th row4
5th row4

Common Values

ValueCountFrequency (%)
4 203
34.1%
3 159
26.7%
2 84
14.1%
졸업 81
 
13.6%
석사 60
 
10.1%
1 7
 
1.2%
박사 1
 
0.2%

Length

2023-12-12T23:18:58.923278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:18:59.061083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 203
34.1%
3 159
26.7%
2 84
14.1%
졸업 81
 
13.6%
석사 60
 
10.1%
1 7
 
1.2%
박사 1
 
0.2%

종목
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
보존처리공
200 
실측설계사보
137 
세척공
69 
화공
64 
칠공
36 
Other values (11)
89 

Length

Max length6
Median length5
Mean length4.2487395
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row실측설계사보
2nd row실측설계사보
3rd row실측설계사보
4th row실측설계사보
5th row실측설계사보

Common Values

ValueCountFrequency (%)
보존처리공 200
33.6%
실측설계사보 137
23.0%
세척공 69
 
11.6%
화공 64
 
10.8%
칠공 36
 
6.1%
제작와공 20
 
3.4%
모사공 15
 
2.5%
도금공 11
 
1.8%
철물공 9
 
1.5%
목조각공 8
 
1.3%
Other values (6) 26
 
4.4%

Length

2023-12-12T23:18:59.205989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보존처리공 200
33.6%
실측설계사보 137
23.0%
세척공 69
 
11.6%
화공 64
 
10.8%
칠공 36
 
6.1%
제작와공 20
 
3.4%
모사공 15
 
2.5%
도금공 11
 
1.8%
철물공 9
 
1.5%
목조각공 8
 
1.3%
Other values (6) 26
 
4.4%

Interactions

2023-12-12T23:18:57.355037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:56.994652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:57.512515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:57.164095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:18:59.289647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번취득년도학과학년종목
연번1.0000.9700.4610.4250.441
취득년도0.9701.0000.3010.3960.295
학과0.4610.3011.0000.5570.859
학년0.4250.3960.5571.0000.451
종목0.4410.2950.8590.4511.000
2023-12-12T23:18:59.386406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학과종목학년
학과1.0000.5470.315
종목0.5471.0000.223
학년0.3150.2231.000
2023-12-12T23:18:59.467933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번취득년도학과학년종목
연번1.0000.9970.2160.2310.188
취득년도0.9971.0000.1350.2130.122
학과0.2160.1351.0000.3150.547
학년0.2310.2130.3151.0000.223
종목0.1880.1220.5470.2231.000

Missing values

2023-12-12T23:18:57.714357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:18:57.841068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번취득년도학과학년종목
012005건축4실측설계사보
122006건축4실측설계사보
232006건축3실측설계사보
342007건축4실측설계사보
452007건축4실측설계사보
562007건축졸업실측설계사보
672007보존4보존처리공
782007보존졸업보존처리공
892007보존졸업보존처리공
9102008건축졸업실측설계사보
연번취득년도학과학년종목
5855862023건축(원)석사실측설계사보
5865872023건축(원)석사실측설계사보
5875882023건축(원)석사실측설계사보
5885892023미공(원)석사모사공
5895902023미공(원)석사칠공
5905912023미공(원)석사칠공
5915922023미공(원)석사칠공
5925932023미공(원)석사화공
5935942023정원문화석사조경공
5945952023수리기술석사보존처리공