Overview

Dataset statistics

Number of variables5
Number of observations1957
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory78.5 KiB
Average record size in memory41.1 B

Variable types

Numeric1
DateTime2
Boolean1
Text1

Dataset

Description국가해양환경온라인 플랫폼 이러닝 콘텐츠 입과생의 학습시작일 종료일 이수여부에 대한 데이터임(2022~2023년)
Author해양수산부
URLhttps://www.data.go.kr/data/15127323/fileData.do

Alerts

입과번호(ENRL_NO) has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:38:25.077792
Analysis finished2024-04-06 08:38:26.294302
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

입과번호(ENRL_NO)
Real number (ℝ)

UNIQUE 

Distinct1957
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7512.8733
Minimum3884
Maximum10812
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.3 KiB
2024-04-06T17:38:26.458254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3884
5-th percentile5284.8
Q15834
median7831
Q38602
95-th percentile9229.4
Maximum10812
Range6928
Interquartile range (IQR)2768

Descriptive statistics

Standard deviation1481.7091
Coefficient of variation (CV)0.19722269
Kurtosis-0.38711047
Mean7512.8733
Median Absolute Deviation (MAD)875
Skewness-0.47152143
Sum14702693
Variance2195461.9
MonotonicityStrictly increasing
2024-04-06T17:38:26.795500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3884 1
 
0.1%
8294 1
 
0.1%
8289 1
 
0.1%
8288 1
 
0.1%
8287 1
 
0.1%
8286 1
 
0.1%
8283 1
 
0.1%
8281 1
 
0.1%
8280 1
 
0.1%
8277 1
 
0.1%
Other values (1947) 1947
99.5%
ValueCountFrequency (%)
3884 1
0.1%
3885 1
0.1%
3886 1
0.1%
3887 1
0.1%
3888 1
0.1%
3889 1
0.1%
3890 1
0.1%
3891 1
0.1%
3892 1
0.1%
3893 1
0.1%
ValueCountFrequency (%)
10812 1
0.1%
10806 1
0.1%
10804 1
0.1%
10795 1
0.1%
10794 1
0.1%
10791 1
0.1%
10787 1
0.1%
10779 1
0.1%
10774 1
0.1%
10769 1
0.1%
Distinct19
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
Minimum2023-01-01 00:00:00
Maximum2024-03-14 00:00:00
2024-04-06T17:38:27.275523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:38:27.729352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
Distinct15
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
Minimum2023-06-30 00:00:00
Maximum2024-12-31 00:00:00
2024-04-06T17:38:28.037328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:38:28.365218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
True
1652 
False
305 
ValueCountFrequency (%)
True 1652
84.4%
False 305
 
15.6%
2024-04-06T17:38:28.627216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct1653
Distinct (%)84.5%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2024-04-06T17:38:29.166272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length7.9770056
Min length3

Characters and Unicode

Total characters15611
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1652 ?
Unique (%)84.4%

Sample

1st row23-03884
2nd row23-03885
3rd row23-03886
4th row23-03887
5th row미수료
ValueCountFrequency (%)
미수료 305
 
15.6%
23-특-08197 1
 
0.1%
23-특-08180 1
 
0.1%
23-08195 1
 
0.1%
23-특-08194 1
 
0.1%
23-특-08193 1
 
0.1%
23-특-08189 1
 
0.1%
23-특-08188 1
 
0.1%
23-특-08187 1
 
0.1%
23-특-08186 1
 
0.1%
Other values (1643) 1643
84.0%
2024-04-06T17:38:30.492360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2392
15.3%
2 2165
13.9%
0 2111
13.5%
3 2057
13.2%
8 1012
6.5%
7 998
6.4%
5 969
6.2%
740
 
4.7%
4 612
 
3.9%
9 593
 
3.8%
Other values (5) 1962
12.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11564
74.1%
Dash Punctuation 2392
 
15.3%
Other Letter 1655
 
10.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 2165
18.7%
0 2111
18.3%
3 2057
17.8%
8 1012
8.8%
7 998
8.6%
5 969
8.4%
4 612
 
5.3%
9 593
 
5.1%
6 557
 
4.8%
1 490
 
4.2%
Other Letter
ValueCountFrequency (%)
740
44.7%
305
18.4%
305
18.4%
305
18.4%
Dash Punctuation
ValueCountFrequency (%)
- 2392
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13956
89.4%
Hangul 1655
 
10.6%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2392
17.1%
2 2165
15.5%
0 2111
15.1%
3 2057
14.7%
8 1012
7.3%
7 998
7.2%
5 969
6.9%
4 612
 
4.4%
9 593
 
4.2%
6 557
 
4.0%
Hangul
ValueCountFrequency (%)
740
44.7%
305
18.4%
305
18.4%
305
18.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13956
89.4%
Hangul 1655
 
10.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2392
17.1%
2 2165
15.5%
0 2111
15.1%
3 2057
14.7%
8 1012
7.3%
7 998
7.2%
5 969
6.9%
4 612
 
4.4%
9 593
 
4.2%
6 557
 
4.0%
Hangul
ValueCountFrequency (%)
740
44.7%
305
18.4%
305
18.4%
305
18.4%

Interactions

2024-04-06T17:38:25.510613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:38:30.728281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입과번호(ENRL_NO)학습시작일(LRNG_BGNG_DT)학습종료일(LRNG_END_DT)이수여부(CMPTN_YN)
입과번호(ENRL_NO)1.0000.8520.8110.355
학습시작일(LRNG_BGNG_DT)0.8521.0001.0000.381
학습종료일(LRNG_END_DT)0.8111.0001.0000.348
이수여부(CMPTN_YN)0.3550.3810.3481.000
2024-04-06T17:38:31.044616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입과번호(ENRL_NO)이수여부(CMPTN_YN)
입과번호(ENRL_NO)1.0000.354
이수여부(CMPTN_YN)0.3541.000

Missing values

2024-04-06T17:38:25.886425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:38:26.169122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

입과번호(ENRL_NO)학습시작일(LRNG_BGNG_DT)학습종료일(LRNG_END_DT)이수여부(CMPTN_YN)수료번호(CRTF_NO)
038842023-01-012023-12-31Y23-03884
138852023-01-012023-12-31Y23-03885
238862023-01-012023-12-31Y23-03886
338872023-01-012023-12-31Y23-03887
438882023-01-012023-12-31N미수료
538892023-01-012023-12-31Y23-03889
638902023-01-012023-12-31N미수료
738912023-01-012023-12-31Y23-03891
838922023-01-012023-12-31Y23-03892
938932023-01-012023-12-31Y23-03893
입과번호(ENRL_NO)학습시작일(LRNG_BGNG_DT)학습종료일(LRNG_END_DT)이수여부(CMPTN_YN)수료번호(CRTF_NO)
1947107692024-01-012024-12-31Y24-10769
1948107742024-03-122024-12-31Y24-10774
1949107792024-03-122024-12-31Y24-10779
1950107872024-01-032024-12-31Y24-10787
1951107912024-01-012024-12-31Y24-10791
1952107942024-03-122024-12-31Y24-10794
1953107952024-01-012024-12-31Y24-10795
1954108042024-01-012024-12-31Y24-10804
1955108062024-01-032024-12-31Y24-10806
1956108122024-01-012024-12-31Y24-10812