Overview

Dataset statistics

Number of variables4
Number of observations33
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory39.0 B

Variable types

Categorical2
Numeric2

Dataset

Description사립학교교직원연금공단 월별 홈페이지 가입자 현황과 관련된 데이터로 연도, 월, 가입자, 신규가입자 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15065011/fileData.do

Alerts

가입인원수 is highly overall correlated with 기준연도High correlation
기준연도 is highly overall correlated with 가입인원수High correlation
가입인원수 has unique valuesUnique
신규가입인원수 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:28:39.012559
Analysis finished2023-12-12 03:28:39.739529
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)12.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
2021
12 
2022
12 
2023
2020

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2021

Common Values

ValueCountFrequency (%)
2021 12
36.4%
2022 12
36.4%
2023 5
15.2%
2020 4
 
12.1%

Length

2023-12-12T12:28:39.852530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:28:39.993372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 12
36.4%
2022 12
36.4%
2023 5
15.2%
2020 4
 
12.1%

기준월
Categorical

Distinct12
Distinct (%)36.4%
Missing0
Missing (%)0.0%
Memory size396.0 B
="09"
="10"
="11"
="12"
="01"
Other values (7)
18 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row="09"
2nd row="10"
3rd row="11"
4th row="12"
5th row="01"

Common Values

ValueCountFrequency (%)
="09" 3
9.1%
="10" 3
9.1%
="11" 3
9.1%
="12" 3
9.1%
="01" 3
9.1%
="02" 3
9.1%
="03" 3
9.1%
="04" 3
9.1%
="05" 3
9.1%
="06" 2
6.1%
Other values (2) 4
12.1%

Length

2023-12-12T12:28:40.495445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
09 3
9.1%
10 3
9.1%
11 3
9.1%
12 3
9.1%
01 3
9.1%
02 3
9.1%
03 3
9.1%
04 3
9.1%
05 3
9.1%
06 2
6.1%
Other values (2) 4
12.1%

가입인원수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean502208.21
Minimum459962
Maximum548423
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-12T12:28:40.664222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum459962
5-th percentile462872
Q1480012
median500025
Q3524491
95-th percentile545067.8
Maximum548423
Range88461
Interquartile range (IQR)44479

Descriptive statistics

Standard deviation27163.986
Coefficient of variation (CV)0.054089092
Kurtosis-1.20686
Mean502208.21
Median Absolute Deviation (MAD)22040
Skewness0.096492836
Sum16572871
Variance7.3788214 × 108
MonotonicityStrictly increasing
2023-12-12T12:28:40.841234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
459962 1
 
3.0%
527061 1
 
3.0%
511943 1
 
3.0%
514692 1
 
3.0%
516947 1
 
3.0%
519553 1
 
3.0%
522065 1
 
3.0%
524491 1
 
3.0%
529400 1
 
3.0%
461699 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
459962 1
3.0%
461699 1
3.0%
463654 1
3.0%
465805 1
3.0%
468657 1
3.0%
471224 1
3.0%
475428 1
3.0%
478072 1
3.0%
480012 1
3.0%
482941 1
3.0%
ValueCountFrequency (%)
548423 1
3.0%
547235 1
3.0%
543623 1
3.0%
538600 1
3.0%
535301 1
3.0%
531700 1
3.0%
529400 1
3.0%
527061 1
3.0%
524491 1
3.0%
522065 1
3.0%

신규가입인원수
Real number (ℝ)

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2709.3333
Minimum1188
Maximum5023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-12T12:28:41.019219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1188
5-th percentile1778.4
Q12151
median2512
Q33103
95-th percentile4458
Maximum5023
Range3835
Interquartile range (IQR)952

Descriptive statistics

Standard deviation885.80679
Coefficient of variation (CV)0.3269464
Kurtosis0.84540177
Mean2709.3333
Median Absolute Deviation (MAD)382
Skewness1.0562626
Sum89408
Variance784653.67
MonotonicityNot monotonic
2023-12-12T12:28:41.204740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1920 1
 
3.0%
2570 1
 
3.0%
3976 1
 
3.0%
2749 1
 
3.0%
2255 1
 
3.0%
2606 1
 
3.0%
2512 1
 
3.0%
2426 1
 
3.0%
2339 1
 
3.0%
1737 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1188 1
3.0%
1737 1
3.0%
1806 1
3.0%
1920 1
3.0%
1940 1
3.0%
1955 1
3.0%
1959 1
3.0%
2130 1
3.0%
2151 1
3.0%
2171 1
3.0%
ValueCountFrequency (%)
5023 1
3.0%
4839 1
3.0%
4204 1
3.0%
3976 1
3.0%
3647 1
3.0%
3612 1
3.0%
3601 1
3.0%
3299 1
3.0%
3103 1
3.0%
2852 1
3.0%

Interactions

2023-12-12T12:28:39.359684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:28:39.162529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:28:39.475097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:28:39.240485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:28:41.333969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도기준월가입인원수신규가입인원수
기준연도1.0000.0000.9350.801
기준월0.0001.0000.0000.628
가입인원수0.9350.0001.0000.534
신규가입인원수0.8010.6280.5341.000
2023-12-12T12:28:41.457648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준월기준연도
기준월1.0000.000
기준연도0.0001.000
2023-12-12T12:28:41.584312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가입인원수신규가입인원수기준연도기준월
가입인원수1.0000.3750.7650.000
신규가입인원수0.3751.0000.4190.253
기준연도0.7650.4191.0000.000
기준월0.0000.2530.0001.000

Missing values

2023-12-12T12:28:39.605787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:28:39.698822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도기준월가입인원수신규가입인원수
02020="09"4599621920
12020="10"4616991737
22020="11"4636541955
32020="12"4658052151
42021="01"4686572852
52021="02"4712242567
62021="03"4754284204
72021="04"4780722644
82021="05"4800121940
92021="06"4829411959
기준연도기준월가입인원수신규가입인원수
232022="08"5220652512
242022="09"5244912426
252022="10"5270612570
262022="11"5294002339
272022="12"5317002300
282023="01"5353013601
292023="02"5386003299
302023="03"5436235023
312023="04"5472353612
322023="05"5484231188