Overview

Dataset statistics

Number of variables5
Number of observations552
Missing cells0
Missing cells (%)0.0%
Duplicate rows56
Duplicate rows (%)10.1%
Total size in memory21.7 KiB
Average record size in memory40.2 B

Variable types

Categorical1
DateTime2
Boolean2

Dataset

Description한국기계연구원의 연구관리 분야에서 사업/과제계획서파견연구원월별상세를 관리하는 테이블 정보(파견자, 파견적용일, 내부흡수여부, 준비금여부 등을 관리)
URLhttps://www.data.go.kr/data/15078067/fileData.do

Alerts

준비금여부 has constant value ""Constant
작성일 has constant value ""Constant
Dataset has 56 (10.1%) duplicate rowsDuplicates
파견자 is highly overall correlated with 내부흡수여부High correlation
내부흡수여부 is highly overall correlated with 파견자High correlation

Reproduction

Analysis started2023-12-12 05:47:50.338157
Analysis finished2023-12-12 05:47:50.669690
Duration0.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

파견자
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
*승*
68 
*진*
49 
*정*
43 
*동*
42 
*상*
42 
Other values (18)
308 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row*치*
2nd row*치*
3rd row*치*
4th row*치*
5th row*치*

Common Values

ValueCountFrequency (%)
*승* 68
12.3%
*진* 49
 
8.9%
*정* 43
 
7.8%
*동* 42
 
7.6%
*상* 42
 
7.6%
*용* 36
 
6.5%
*민* 36
 
6.5%
*아* 34
 
6.2%
*성* 34
 
6.2%
*유* 34
 
6.2%
Other values (13) 134
24.3%

Length

2023-12-12T14:47:50.744928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
68
12.3%
49
 
8.9%
43
 
7.8%
42
 
7.6%
42
 
7.6%
36
 
6.5%
36
 
6.5%
34
 
6.2%
34
 
6.2%
34
 
6.2%
Other values (13) 134
24.3%
Distinct61
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum2019-01-01 00:00:00
Maximum2024-01-01 00:00:00
2023-12-12T14:47:51.177835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:47:51.346801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

내부흡수여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size684.0 B
False
466 
True
86 
ValueCountFrequency (%)
False 466
84.4%
True 86
 
15.6%
2023-12-12T14:47:51.525819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

준비금여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size684.0 B
False
552 
ValueCountFrequency (%)
False 552
100.0%
2023-12-12T14:47:51.670327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

작성일
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum2023-07-28 00:00:00
Maximum2023-07-28 00:00:00
2023-12-12T14:47:51.771200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:47:51.876474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T14:47:51.953716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파견자파견적용일내부흡수여부
파견자1.0000.0000.839
파견적용일0.0001.0000.000
내부흡수여부0.8390.0001.000
2023-12-12T14:47:52.059266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파견자내부흡수여부
파견자1.0000.752
내부흡수여부0.7521.000
2023-12-12T14:47:52.150050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파견자내부흡수여부
파견자1.0000.752
내부흡수여부0.7521.000

Missing values

2023-12-12T14:47:50.496946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:47:50.606844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

파견자파견적용일내부흡수여부준비금여부작성일
0*치*2019-02-01YN2023-07-28
1*치*2019-03-01YN2023-07-28
2*치*2019-04-01YN2023-07-28
3*치*2019-05-01YN2023-07-28
4*치*2019-06-01YN2023-07-28
5*치*2019-07-01NN2023-07-28
6*민*2019-03-01NN2023-07-28
7*민*2019-04-01NN2023-07-28
8*민*2019-05-01NN2023-07-28
9*민*2019-06-01NN2023-07-28
파견자파견적용일내부흡수여부준비금여부작성일
542*수*2022-02-01NN2023-07-28
543*수*2022-03-01NN2023-07-28
544*수*2022-04-01NN2023-07-28
545*수*2022-05-01NN2023-07-28
546*수*2022-06-01NN2023-07-28
547*수*2022-07-01NN2023-07-28
548*수*2022-08-01NN2023-07-28
549*수*2022-09-01NN2023-07-28
550*수*2022-10-01NN2023-07-28
551*수*2022-11-01NN2023-07-28

Duplicate rows

Most frequently occurring

파견자파견적용일내부흡수여부준비금여부작성일# duplicates
0*동*2020-11-01NN2023-07-282
1*동*2020-12-01NN2023-07-282
2*동*2021-01-01NN2023-07-282
3*동*2021-02-01NN2023-07-282
4*민*2019-10-01NN2023-07-282
5*민*2019-11-01NN2023-07-282
6*상*2020-01-01NN2023-07-282
7*상*2020-02-01NN2023-07-282
8*상*2020-03-01NN2023-07-282
9*승*2020-02-01NN2023-07-282