Overview

Dataset statistics

Number of variables4
Number of observations130
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory36.0 B

Variable types

Text1
Numeric3

Dataset

Description한국해양수산연수원의 선원교육업무와 관련된 교육예약시스템의 회원가입 통계 현황에 대한 정보 기간 : 2012년 3월 ~ 2022년 12월
URLhttps://www.data.go.kr/data/15121738/fileData.do

Alerts

합계 is highly overall correlated with 일반회원High correlation
일반회원 is highly overall correlated with 합계High correlation
회원가입 연월 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:10:04.689740
Analysis finished2023-12-12 14:10:05.890115
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회원가입 연월
Text

UNIQUE 

Distinct130
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T23:10:06.012316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length8.2538462
Min length8

Characters and Unicode

Total characters1073
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique130 ?
Unique (%)100.0%

Sample

1st row2022년 12월
2nd row2022년 11월
3rd row2022년 10월
4th row2022년 9월
5th row2022년 8월
ValueCountFrequency (%)
2022년 12
 
4.6%
2015년 12
 
4.6%
2018년 12
 
4.6%
2019년 12
 
4.6%
2014년 12
 
4.6%
2017년 12
 
4.6%
2021년 12
 
4.6%
2013년 12
 
4.6%
2020년 12
 
4.6%
2016년 12
 
4.6%
Other values (13) 140
53.8%
2023-12-12T23:10:06.311468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 209
19.5%
1 160
14.9%
0 153
14.3%
130
12.1%
130
12.1%
130
12.1%
5 23
 
2.1%
9 23
 
2.1%
8 23
 
2.1%
7 23
 
2.1%
Other values (3) 69
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 683
63.7%
Other Letter 260
 
24.2%
Space Separator 130
 
12.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 209
30.6%
1 160
23.4%
0 153
22.4%
5 23
 
3.4%
9 23
 
3.4%
8 23
 
3.4%
7 23
 
3.4%
6 23
 
3.4%
4 23
 
3.4%
3 23
 
3.4%
Other Letter
ValueCountFrequency (%)
130
50.0%
130
50.0%
Space Separator
ValueCountFrequency (%)
130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 813
75.8%
Hangul 260
 
24.2%

Most frequent character per script

Common
ValueCountFrequency (%)
2 209
25.7%
1 160
19.7%
0 153
18.8%
130
16.0%
5 23
 
2.8%
9 23
 
2.8%
8 23
 
2.8%
7 23
 
2.8%
6 23
 
2.8%
4 23
 
2.8%
Hangul
ValueCountFrequency (%)
130
50.0%
130
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 813
75.8%
Hangul 260
 
24.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 209
25.7%
1 160
19.7%
0 153
18.8%
130
16.0%
5 23
 
2.8%
9 23
 
2.8%
8 23
 
2.8%
7 23
 
2.8%
6 23
 
2.8%
4 23
 
2.8%
Hangul
ValueCountFrequency (%)
130
50.0%
130
50.0%

합계
Real number (ℝ)

HIGH CORRELATION 

Distinct116
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1038.7231
Minimum5
Maximum49973
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T23:10:06.471312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile397.5
Q1532.5
median631.5
Q3766.5
95-th percentile1055.95
Maximum49973
Range49968
Interquartile range (IQR)234

Descriptive statistics

Standard deviation4330.4459
Coefficient of variation (CV)4.1690091
Kurtosis129.343
Mean1038.7231
Median Absolute Deviation (MAD)111
Skewness11.358924
Sum135034
Variance18752762
MonotonicityNot monotonic
2023-12-12T23:10:06.641489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
491 3
 
2.3%
783 2
 
1.5%
561 2
 
1.5%
743 2
 
1.5%
629 2
 
1.5%
388 2
 
1.5%
651 2
 
1.5%
994 2
 
1.5%
660 2
 
1.5%
615 2
 
1.5%
Other values (106) 109
83.8%
ValueCountFrequency (%)
5 1
0.8%
66 1
0.8%
130 1
0.8%
335 1
0.8%
388 2
1.5%
393 1
0.8%
403 1
0.8%
414 1
0.8%
432 1
0.8%
433 1
0.8%
ValueCountFrequency (%)
49973 1
0.8%
1492 1
0.8%
1420 1
0.8%
1198 1
0.8%
1121 1
0.8%
1063 1
0.8%
1060 1
0.8%
1051 1
0.8%
994 2
1.5%
978 1
0.8%

선사회원
Real number (ℝ)

Distinct33
Distinct (%)25.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean138.74615
Minimum0
Maximum15703
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T23:10:06.815758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median6
Q313
95-th percentile30.55
Maximum15703
Range15703
Interquartile range (IQR)10

Descriptive statistics

Standard deviation1378.655
Coefficient of variation (CV)9.9365274
Kurtosis128.84787
Mean138.74615
Median Absolute Deviation (MAD)4
Skewness11.329523
Sum18037
Variance1900689.5
MonotonicityNot monotonic
2023-12-12T23:10:06.973781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
3 16
 
12.3%
4 13
 
10.0%
2 11
 
8.5%
6 9
 
6.9%
1 9
 
6.9%
10 8
 
6.2%
5 7
 
5.4%
9 5
 
3.8%
8 5
 
3.8%
13 4
 
3.1%
Other values (23) 43
33.1%
ValueCountFrequency (%)
0 1
 
0.8%
1 9
6.9%
2 11
8.5%
3 16
12.3%
4 13
10.0%
5 7
5.4%
6 9
6.9%
7 4
 
3.1%
8 5
 
3.8%
9 5
 
3.8%
ValueCountFrequency (%)
15703 1
0.8%
1032 1
0.8%
154 1
0.8%
37 2
1.5%
36 1
0.8%
31 1
0.8%
30 1
0.8%
28 1
0.8%
27 1
0.8%
25 1
0.8%

일반회원
Real number (ℝ)

HIGH CORRELATION 

Distinct118
Distinct (%)90.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean899.97692
Minimum5
Maximum34270
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T23:10:07.145275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile383.5
Q1522.75
median620.5
Q3753.25
95-th percentile1016.85
Maximum34270
Range34265
Interquartile range (IQR)230.5

Descriptive statistics

Standard deviation2956.2845
Coefficient of variation (CV)3.2848447
Kurtosis128.77088
Mean899.97692
Median Absolute Deviation (MAD)107
Skewness11.321542
Sum116997
Variance8739617.8
MonotonicityNot monotonic
2023-12-12T23:10:07.337777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
525 3
 
2.3%
632 2
 
1.5%
1038 2
 
1.5%
664 2
 
1.5%
599 2
 
1.5%
576 2
 
1.5%
717 2
 
1.5%
667 2
 
1.5%
645 2
 
1.5%
522 2
 
1.5%
Other values (108) 109
83.8%
ValueCountFrequency (%)
5 1
0.8%
64 1
0.8%
126 1
0.8%
325 1
0.8%
369 1
0.8%
370 1
0.8%
379 1
0.8%
389 1
0.8%
411 1
0.8%
424 1
0.8%
ValueCountFrequency (%)
34270 1
0.8%
1404 1
0.8%
1170 1
0.8%
1114 1
0.8%
1052 1
0.8%
1038 2
1.5%
991 1
0.8%
989 1
0.8%
962 1
0.8%
930 1
0.8%

Interactions

2023-12-12T23:10:05.396572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:04.823324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:05.110348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:05.494969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:04.908475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:05.200028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:05.620104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:05.002651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:10:05.290300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:10:07.445664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
합계선사회원일반회원
합계1.0000.6960.696
선사회원0.6961.0000.696
일반회원0.6960.6961.000
2023-12-12T23:10:07.545358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
합계선사회원일반회원
합계1.0000.1930.963
선사회원0.1931.0000.111
일반회원0.9630.1111.000

Missing values

2023-12-12T23:10:05.756720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:10:05.853591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회원가입 연월합계선사회원일반회원
02022년 12월7656759
12022년 11월5641563
22022년 10월7554751
32022년 9월5703567
42022년 8월8005795
52022년 7월6391638
62022년 6월7912789
72022년 5월6643661
82022년 4월80910799
92022년 3월7673764
회원가입 연월합계선사회원일반회원
1202012년 12월61510605
1212012년 11월5879578
1222012년 10월47937442
1232012년 9월33510325
1242012년 8월6166610
1252012년 7월14921032460
1262012년 6월499731570334270
1272012년 5월1304126
1282012년 4월66264
1292012년 3월505